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DB 


ID 
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AAB16955 


Human IgGl Fc prot 
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1238 


100. 


.0 


228 
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AAY96529 


Human IgGl Fc chai 
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100. 


.0 
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100. 


.0 
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Human IgGl Fc . Ho 
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.0 
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.0 
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Human immunoglobul 
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100. 


.0 
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AAU73018 
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1238 


100. 


.0 
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23 


AAG66012 


Human immunoglobul 
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1238 


100. 


.0 


228 


23 


ABB04279 


Human IgGl Fc doma 


10 


1238 


100. 


.0 
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24 


ABJ38267 


Human IgGl Fc prot 


11 


1238 


100, 


.0 


243 


21 


AAB17957 


Fc-MMP inhibitor f 


12 


1238 


100. 


. 0 


243 


23 


ABB73425 


Fc-MMP inhibitor f 


13 


1238 


100. 


.0 


247 


21 


AAB16958 


Fc-TMP protein seq 


14 


1238 


100. 


.0 


247 


23 


ABB73411 


Fc-TPO mimetic pep 


15 


1238 


100, 


.0 
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21 


AAB17951 


Fc-TNF-alpha inhib 
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100. 
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Fc-IL-1 antagonist 
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100, 


.0 
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100, 
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Fc-interleukin 1 ( 
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100. 
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21 


AAB17955 


Fc-VEGF antagonist 


20 
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100. 


. 0 
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23 
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Fc-VEGF antagonist 
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100. 
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Fc-EMP protein seq 


22 


1238 


100. 


.0 
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23 


ABB73415 


Fc-EPO mimetic pep 
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100. 


.0 


268 


21 


AAB16959 


Fc-TMP-TMP protein 
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1238 


100. 


.0 


268 


23 


ABB73412 


Fc-TMP-TMP amino a 
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1238 


100. 


.0 


269 


21 


AAY96531 


Human IgGl Fc TMP 
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1238 


100. 


.0 


277 


21 


AAB16967 


Fc-EMP-EMP protein 
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1238 


100. 


.0 


277 


23 


ABB73418 


Fc-EMP-EMP nucleic 
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1238 


100. 


.0 
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23 


AAU8 1169 


Echistatin/IgG Fc 
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100. 


,0 
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Recombinant human 
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100. 
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100. 


. 0 
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.0 
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22 
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JO 
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Human IgG Fc fragm 
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AAY01372 
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TALL-1 inhibitory 
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.7 


248 


24 
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1234 


99. 


1 


248 


24 
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99. 
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252 


24 


ABJ38335 


TALL-1 inhibitory 
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ABJ38336 


TALL-1 inhibitory 
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, 7 


252 


24 


ABJ38337 


TALL-1 inhibitory 
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99. 


7 


252 


24 


ABJ38338 


TALL-1 inhibitory 
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99. 


7 


252 


24 


ABJ38339 


TALL-1 inhibitory 


43 
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99. 


7 


252 


24 


ABJ3 834 0 


TALL-1 inhibitory 
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99. 


7 


252 


24 
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TALL-1 inhibitory 


45 
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99. 


7 


252 


24 


ABJ38342 


TALL-1 inhibitory 



ALIGNMENTS 



RESULT 1 
AAB16955 

ID AAB16955 standard; Protein; 228 AA. 
XX 

AC AAB16955; 
XX 

DT 31-OCT-2000 (first entry) 
XX 

DE Human IgGl Fc protein sequence SEQ ID NO: 2. 
XX 

KW Modified peptide; therapeutic agent; fusion; Fc domain; cancer; 

KW autoimmune disease; cytostatic; antiasthmatic; thrombolytic; VEGF; 

KW immunosuppressive; EPO; TPO; CTLA4 ; mimetic; IL-1; TNF; antagonist ; 

KW MMP; inhibitor; erythropoietin; thrombopoietin; interleukin 1; 

KW cytotoxic T cell lymphocyte antigen 4; tumour necrosis factor; 

KW vascular endothelial growth factor; matrix metal loproteinase; 

KW asthma; thrombosis; pharmaceutical. 
XX 

OS Homo sapiens . 
XX 

PN WO200024782-A2. 
XX 

PD 04-MAY-2000. 
XX 

PF 25-OCT-1999; 99WO-US25044 . 
XX 

PR 23-OCT-1998; 98US-0105371 . 

PR 22-OCT-1999; 99US-0428082 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham J # Boone TC; 
XX 

DR WPI; 2000-350702/30. 

DR N-PSDB; AAA69443 . 
XX 

PT Novel composition of matter comprising an Fc domain and 

PT pharmacologically active peptides, useful for treating cancer and 

PT autoimmune diseases - 

XX 

PS Claim 7; Page 176-177; 608pp; English. 
XX 

CC The present invention describes composition of matter (I) comprising an 

CC Fc domain, pharmacologically active peptides, and linkers. Where (I) is: 

CC (Xl)a-Fl- (X2)b, where: Fl = an Fc domain; XI and X2 = are each 

CC independently selected from -(LI) c-Pl, - (LI) c-Pl- {L2)d-P2, 

CC - (Ll)c-Pl- (L2)d-P2- (L3)e-P A 3, or - (LI) c-Pl- (L2 ) d-P2- (L3) e-P3- (L4) f -P4 

CC where PI, P2 , P3, and P4 = are each independently sequences of 

CC pharmacologically active peptides; LI, L2, L3, and L4 = are each 

CC independently linkers; and a, b, c, d, e, and f = are each independently 

CC 0 or 1, provided that at least 1 of a and b is 1. The composition can 

CC have cytostatic, antiasthmatic, thrombolytic and immunosuppressive 

CC activities. DNAs , vectors and host cells from the present invention can 

CC be used for producing pharmaceutical compositions. The compositions are 

CC useful for treating cancer, asthma, thrombosis, or autoimmune diseases. 

CC The use of an Fc domain (rather than a Fab domain) can provide a longer 

CC half-life or incorporate functions such as Fc receptor binding, protein 



CC A binding, complement fixation, and possibly placental transfer. AAA69443 

CC to AAA69526 and AAB16955 to AAB18003 represent nucleotide and amino acid 

CC sequences used in the exemplification of the present invention. 
XX 

SQ Sequence 228 AA; 

Query Match 100.0%; Score 1238; DB 21; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 i I M 1 1 1 i 1 1 1 1 M 1 1 II 1 1 1 1 ' I I II 1 1 1 i II 1 1 1 1 1 1 II 1 1 1 1 ' 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWA^DVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IMIMMIIIIIIMIIIIIIMIIIIIIIIIIIIIIIMIIIIIMIMIIIIMIII 

Db 61 DGVE VHNAKTKPREEQYNSTYRWS VXiTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

I I I I I I I I I I I I I I I I I M I I II I II I I I I I II I I I I I I I I I I I II I I I I I I II II I II I 
Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I I I I I I I I I M . I I I I I I I I II I I I I I I I I I I I I I I I i I I I I I I 
Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

RESULT 2 
AAY96529 

ID AAY96529 standard; Protein; 228 AA. 
XX 

AC AAY96529; 
XX 

DT 04-SEP-2000 (first entry) 
XX 

DE Human IgGl Fc chain. 
XX 
KW 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN WO200024770-A2. 
XX 

PD 04-MAY-2000. 
XX 

PF 22-OCT-1999; 99WO-US24 834 . 
XX 

PR 23-OCT-1998; 98US- 0105348 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Liu C, Feige U, Cheetham J; 
XX 

DR WPI; 2000-365108/31. 
DR N-PSDB; AAA29220. 



Immunoglobulin; IgGl; Fc; thrombopoietin; mimetic; TMP; TPO; platelet; 
megakaryocyte; production; anti-human immunodeficiency virus; anti-HIV; 
anti-anaemic; dermatological ; immunosuppressive; ant i- inflammatory. 



XX 

PT Thrombopoietic peptides which activate mpl receptors and increase the 

PT production of platelets or platelet precursors, useful for treatment of 

PT diseases which involve thombocytopenia 
XX 

PS Disclosure; Page 76-77; 91pp; English. 
XX 

CC A compound which binds to an mpl receptor comprising a thrombopoietin 

CC mimetic peptide (TMP) dimer joined by a linker [TMP_1- (L_l) __nTMP_2] , 

CC is new. TMP_1 and TMP_2 are amino acid sequences varying from at least 

CC 10 to 14 residues in length comprising X_2-X_l_0, X_2-X_l_l, X_2-X_l_2, 

CC X_2-X_l_3, X_2-X_l_4, X_1-X__1_0, X_l-X_l_l, X_l-X_l_2, X_l-X_l_3 / and 

CC X_l-X_l_4 . X_l = I, A, V ( L, S or R; XJ2 = E, D, K or V; X_3 - G or A; 

CC X_4 = P; X_5 = T or S; X__6 = L, I, V, A or F; X_7 = R or K; X_8 = Q, N, 

CC or E; X_9 = W, Y or F; X_1_0 = L, I , V, A, F, M, or K; X_l_l = A, I, V, 

CC L, F, S, T, K, H, or E; X_l_2 = A, I, V, L, F, G, S, or Q; X_l_3 - R, K, 

CC T, V, N, Q or G; X_l_4 = A, I, V, L, F, T, R, E, or G; L_l - linker 

CC comprising 1 to 20 amino acids; and n = 0 or 1 . The compounds bind to and 

CC activate the c-Mpl receptor which mediates the activity of endogenous 

CC thrombopoietin. The TMPs are useful for increasing the production of 

CC platelets or platelet precursors (e.g. megakaryocytes) in a mammal, which 

CC is useful for treatment of diseases which involve thombocytopenia, e.g. 

CC aplastic anaemia, immune thrombocytopenia (ITP) , human immunodeficiency 

CC virus associated ITP, and systemic lupus erythematosus. 
XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 21; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTIJyiISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

IIIIIMIIIIIIIMIIIIIIMIIMIIIIIIIIIIMIIIIIIMIIIIIIMIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVXiTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

QY 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

IN Ml II MINI II I MUM Ml III II MINI II Ml MINI II III II Mill 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IMIMMIIIMIIIMIIIIMIIIIIIIIIIIIIIMIIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 3 
AAB98953 

ID AAB98953 standard; Protein; 228 AA. 
XX 

AC AAB98953; 
XX 

DT 14-AUG-2001 (first entry) 
XX 

DE Human IgGl Fc region. 



XX 

KW Human; IgGl; immunoglobulin; Fc region; Fc fusion protein; 

KW misfolding; therapy; cancer; osteoarthritis; AIDS; obesity; 

KW inflammation; transplant rejection. 
XX 

OS Homo sapiens. 
XX 

PN WO200134638-A1. 
XX 

PD 17-MAY-2001. 
XX 

PF 10-NOV-2000; 2000WO-US30798 . 
XX 

PR 12-NOV-1999; 99US- 0165188 . 

PR 09-NOV-2000; 2000US-0709704 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Treuheit MJ, 0* Conner SR, Kosky AA; 
XX 

DR WPI; 2001-335908/35. 

DR N-PSDB; AAH25762 . 
XX 

PT Correcting disulfide bond misfolds in Fc-containing proteins, 

PT particularly therapeutic Fc-containing fusion proteins or antibodies, 

PT by treatment with copper halide 

XX 

PS Claim 30; Fig 5; 59pp; English. 
XX 

CC The present invention describes a process for preparing a 

CC pharmacologically active compound, involving preparing a compound 

CC comprising an immunoglobulin Fc domain fused to a protein of interest, 

CC treating the compound with a copper (II) halide and isolating the treated 

CC molecule. This can be used to correct misfolding of Fc domain containing 

CC proteins, for use in therapeutic agents which may be used in the 

CC treatment of cancer, inflammation, transplant rejection, AIDS, 

CC osteoarthritis and obesity. The present sequence is the IgGl Fc domain. 

XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 22; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVT)VSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IMIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIMIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

MIIMIIIMIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIMIIIIMMMIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



IMIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 4 
AAU81074 

ID AAU81074 standard; Protein; 228 AA. 
XX 

AC AAU81074; 
XX 

DT 09-APR-2002 (first entry) 
XX 

DE Human IgGl Fc. 
XX 

KW Human; IgG Fc; anticoagulant; thrombolytic; cytostatic; 

KW antiinflammatory; immunosuppressive; osteopathic; antagonist; 

KW laminin; saw-scaled viper; echistatin; integrin; selectin; vinculin; 

KW platelet aggregation; angiogenesis ; tumour; inflammation; 

KW autoimmune disease; rheumatoid arthritis; osteoporosis. 

XX 

OS Homo sapiens . 
XX 

PN WO200181377-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 23-APR-2001; 2001WO-US13069 . 
XX 

PR 21-APR-2000; 2000US-198919P . 

PR 03-MAY-2000; 2000US-201394P . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Kohno T, Lacey DL, Boone TC; 
XX 

DR WPI; 2002-062025/08. 

DR N-PSDB; ABK24097. 
XX 

PT Composition comprising integrin or adhesion antagonistic peptide and 

PT vehicle, useful for treating or preventing platelet aggregation, has a 

PT longer half-life than free peptide 
XX 

PS Claim 9; Fig 3; 68pp; English. 
XX 

CC The invention relates to a composition comprising an integrin/adhesion 

CC antagonistic peptide (I) and a vehicle e.g. IgG Fc. The peptides 

CC are based on laminin or saw-scaled viper echistatin and target integrin, 

CC selectin or vinculin. Also included are compounds of formula (la) and 

CC their multimers (X A l)_a-F A l- (X A 2)_b where; 

CC F A 1 = Fc domain; 

CC X A 1 and X A 2 = - (L A l)_c-P A l , (L A 1) _c-P A l- <L A 2) _d-P A 2 , 

CC (L A l)_c-P A l- (L A 2)_d-P A 2- (L A 3)_e-P A 3 or 

CC (L A l)_c-P A l- (L A 2)_d-P A 2- (L A 3)_e-P A 3- (L A 4 ) _f -P A 4 ; 

CC P A 1-P A 4 = same or different (I); 

CC L A 1-L A 4 = same or different linkers; 

CC a-f = 0 or 1, provided at least one of a and b = 1, 

CC a nucleic acid that encodes (la) , an expression vector containing the 



CC nucleic acid, host cells containing the vector, producing a 

CC pharmaceutical ly active compound (B) by covalently linking at least one 

CC Fc domain to at least one amino acid sequence of a selected randomized 

CC (I) and any of six laminin-related peptides (lb) . The compositions are 

CC used prophylactically and therapeutically in the same way as (I) , e.g. to 

CC inhibit platelet aggregation or angiogenesis (tumours) , or to treat 

CC inflammation and autoimmune diseases (e.g. rheumatoid arthritis) and many 

CC different forms of osteoporosis, also for diagnosis. Attaching the 

CC vehicle (especially Fc domain) to (I) increases the half-life (free (I) 

CC are normally degraded very quickly in vivo) . The present sequence 

CC is human IgGl Fc which is used as a vehicle for the antagonists of 

CC the invention. 

XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIMIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

iiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiimiii 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

I I I I I I I I I I I II I I I I I II I I II I I I I I I M I I I I II I I I I I I I I I I I I I M I I I I I I I 
Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MIIIIIMIMIIIIIIIIMIIIIIIMIIIIMIIIMIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 5 
ABB73410 

ID ABB73410 standard; Protein; 228 AA. 
XX 

AC ABB73410; 
XX 

DT 05-APR-2002 (first entry) 
XX 

DE Human immunoglobulin Gl Fc (IgGl Fc) amino acid SEQ ID NO:2. 
XX 

KW Modified peptide; mimetic; Fc domain; fusion; immunoglobulin G; IgG; 

KW EPO; erythropoietin; TPO; tumour necrosis factor alpha inhibitor; 

KW TNF-alpha inhibitor; interleukin 1 antagonist; IL-1 antagonist; TMP; 

KW TPO mimetic peptide; EPO mimetic peptide; EMP; VEGF antagonist; 

KW MMP inhibitor; antiinflammatory; antitumour; immunosuppressive; 

KW cytostatic; antirheumatic; antiarthritic; antidiabetic; ophthalmological ; 

KW antianaemic; anorectic; antiinf ertility; haemostatic; dermatological ; 

KW neuroprotective; inflammatory disease; autoimmune disease; tumour growth; 

KW cancer; rheumatoid arthritis; diabetic retinopathy; infertility; obesity; 

KW sleep disorder; neurological degenerative disease; anaemia; 

KW thrombocytopaenia; metastatic tumour; systemic lupus erythematosus; 



KW Fanconi 1 s syndrome . 
XX 

OS Homo sapiens . 
XX 

PN WO200183525-A2. 
XX 

PD 08-NOV-2001. 
XX 

PF 02-MAY-2001; 2001WO-US14310 . 
XX 

PR 03-MAY-2000; 2000US-0563286 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham JC, Boone TC, Gudas JM; 
XX 

DR WPI; 2002-130313/17. 

DR N-PSDB; ABL35760. 
XX 

PT Novel vehicle-peptide molecule or its multimers useful for treating 

PT inflammatory and autoimmune diseases, cancer, rheumatoid arthritis, 

PT diabetic retinopathy, obesity, sleep disorders and infertility 
XX 

PS Claim 7; Fig 4; 176pp; English. 
XX 

CC The present invention describes a vehicle-peptide molecule (I) or its 

CC multimers. (I) can have antiinflammatory, antitumour, immunosuppressive, 

CC cytostatic, antirheumatic, antiarthritic , antidiabetic, ophthal mo logical , 

CC antianaemic, anorectic, antiinf ert ility, haemostatic, dermato logical and 

CC neuroprotective activities. (I) can be used as a therapeutic or 

CC prophylactic agent as well as for screening purposes. (I) is useful for 

CC diagnosing diseases characterised by dysfunction of their associated 

CC protein of interest, for identifying normal or abnormal proteins of 

CC interest, as a part of diagnostic kit to detect the presence of their 

CC proteins of interest in a biological sample. Additionally, (I) is useful 

CC for treating inflammatory and autoimmune diseases, tumour growth, cancer, 

CC rheumatoid arthritis, diabetic retinopathy, obesity, sleep disorders, 

CC infertility, and neurological degenerative diseases. (I), comprising 

CC EPO-mimetic compounds are useful for treating disorders characterised by 

CC low red blood cell levels such as anaemia. The TPO-mimetic comprising 

CC compounds are useful for treating conditions that involve an existing 

CC megakaryocyte/platelet deficiency or an expected megakaryocyte/platelet 

CC deficiency, such as thrombocytopaenia , aplastic anaemia, metastatic 

CC tumour which result in thrombocytopaenia, systemic lupus erythematosus, 

CC and Fanconi 1 s syndrome. ABB72403 to ABB73426 and ABL35695 to ABL35777 

CC represent amino acid and nucleic acid sequences used in the 

CC exemplification of the present invention. 

XX 

SQ Sequence 228 AA; 

Query Match 100.0%; Score 1238; DB 23; Length 228; 
Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCV^VX)VSHEDPEVKFNWYV 60 

I I I I I I I I I M I I I I I I I I I : I I I I I I I I I I M I I I I I I I I I .1 M Ml I II , I i I I 
1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVVVTDVSHEDPEVKFNWYV 60 



Qy 

Db 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

II II I II II I MM M I Mill I Mill I Mill MINI 1 1 III Mill II M 1 1 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i E 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i I J 1 1 1 1 1 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 6 
AAE14310 

ID AAE14310 standard; Protein; 228 AA. 
XX 

AC AAE14310; 
XX 

DT 07-MAR-2002 (first entry) 
XX 

DE Human immunoglobulin G (IgGl) Fc. 
XX 

KW Human; calcitonin; CT; CT receptor; Fc domain; therapy; osteoporosis ; 

KW immunoglobulin G; IgG; osteopathic. 

XX 

OS Homo sapiens . 
XX 

PN WO200183526-A2 . 
XX 

PD 08-NOV-2001. 
XX 

PF 03-MAY-2001; 2001WO-US14320 . 
XX 

PR 03-MAY-2000; 2000US-201511P . 
PR 02-MAY-2001; 2001US-0847712 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Liu C, Marshall WS, Reynolds A; 
XX 

DR WPI; 2002-034503/04. 
DR N-PSDB; AAD23840 . 
XX 

PT Compositions comprising Calcitonin receptor modulator domains, useful 

PT for treating osteoporosis - 

XX 

PS Claim 8; Fig 3; 64pp; English. 
XX 

CC The invention relates to therapeutic agents that modulate the 
CC activity of calcitonin (CT) receptor. Modulators of CT receptor 
CC comprise a CT receptor modulating domain and a vehicle such as a 
CC polymer or an Fc domain, where the vehicle is covalently attached 
CC to the CT receptor modulating domain. The compositions comprising 
CC CT receptor modulating domains are used to treat osteoporosis. 
CC The present sequence is human immunoglobulin G (IgGl) Fc protein 



CC used in the invention. 
XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 
Matches 228; Conservative 0; Mismatches 0; Indels 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



0 ; Gaps 0 ; 

1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

I I II MINI II III 1 1 III I Mill MM: I III I Mlllll Ml 1 1 II I II I III 

1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVVVDVSHEDPEVKFNWYV 60 



61 



120 



DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 

IIMIMIIIIIIIMIIMIIMIIIIMIIIIIIIMIIIIIIIIIIIIMIMIIII 

61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 
121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

II MMMIMMIMM MMMMIMM IIIIIIIIIIIIIIMIIIIIMMI 

121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 



181 



228 



SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 

II MMMMMIMI MIMMMIMM MMMIMIMM 

181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 7 
AAU73018 
ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 



AAU73018 standard; Protein; 228 AA. 
AAU73018; 

12-MAR-2002 (first entry) 
Human immunoglobulin G (IgG) Fc region. 

Human; parathyroid hormone; PTH; parathyroid hormone -related protein; 
PTHrP; bone resorption inhibitor; osteoprotegrin; OPG; OPG-L antibody; 
calcitonin; bisphosphonate ; oestrogen; oestrogen receptor; tibolone; 
osteopenia; hyperthyroidism; hypercalcaemia; tumour metastasis; bone; 
breast cancer; prostate cancer; cachexia; anorexia; osteoporosis; 
Paget 's disease; osteomyelitis; osteonecrosis; bone cell death; 
Gaucher's disease; sickle cell anaemia; systemic lupus erythematosus; 
rheumatoid arthritis; periodontal disease; alopecia; fracture repair,- 
immunoglobulin G; IgG. 

Homo sapiens . 

WO200181415-A2. 

01-NOV-2001. 

27-APR-2001; 2001WO-US13528 . 



27- APR-2000; 

28- JUN-2000; 
06-FEB-2001; 
26-APR-2001; 



2000US-200053P. 
2000US-214860P. 
2001US-266673P. 
2001US-0843221 . 



XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Kostenuik P, Liu C, Lacey DL; 
XX 

DR WPI; 2002-066435/09. 

DR N-PSDB; AAS97392. 
XX 

PT Composition, useful for treating osteopenia, comprises parathyroid 

PT hormone and parathyroid hormone-related protein receptor modulators - 
XX 

PS Claim 6; Figure 3; 107pp ; English. 
XX 

CC The invention relates to a composition (I) comprising modulators of 

CC parathyroid hormone (PTH) and parathyroid hormone -related protein (PTHrP) 

CC which comprise a PTH/ PTHrP modulating domain and a vehicle. (I) 

CC comprising PTH agonist optionally with a bone resorption inhibitor, such 

CC as osteoprotegrin (OPG) , OPG-L antibody, calcitonin, bisphosphonates, 

CC oestrogens, oestrogen receptor modulators and tibolone is useful for 

CC treating osteopenia., (I) is useful for therapeutic and prophylactic 

CC purposes. Antagonists of PTH receptor are useful in treating primary and 

CC secondary hyperthyroidism, hypercalcaemia, tumour metastases, 

CC particularly breast and prostate cancer, cachexia and anorexia, 

CC osteopenia, including various forms of osteoporosis, Paget ! s disease of 

CC bone, osteomyelitis, osteonecrosis or bone cell death, associated with 

CC traumatic injury or nontraumatic necrosis associated with Gaucher' s 

CC disease, sickle cell anaemia, systemic lupus erythematosus, rheumatoid 

CC arthritis, periodontal disease and alopecia. PTH receptor agonists are 

CC useful as therapeutic agents in conditions including fracture repair 

CC (including healing of non-union fractures), osteopenia, including various 

CC forms of osteoporosis. AAU73018-AAU73181 represent parathyroid hormone 

CC and parathyroid hormone related protein ( PTH/ PTHrP) modulators and 

CC related amino acid sequences of the invention. 

XX 

SQ Sequence 228 AA; 

Query Match 100,0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Mill IM , M M II llllill 1 1 1 Mill I Mill I IIIIMI Mill III III 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

U I M II ' M iM I MM, MM M.MI IM M 

Db 61 DGVE VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

I M I II III Mill 1 1 M M 1 1 III II MMII 1 1 III I III III II Mill I M III 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGS FFL YS KLT VDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 228 

' i M MUM 'r M 11 , M Ml ' II I 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 8 
AAG66012 

ID AAG66012 standard; Protein; 228 AA. 
XX 

AC AAG66012; 
XX 

DT 27-FEB-2002 (first entry) 
XX 

DE Human immunoglobulin (Ig) Gl Fc region sequence. 
XX 

KW Apo-AI; amphipathic; pharmaceutical; peptide mimic; antilipemic; 

KW anti-HIV; virucide; immunoglobulin; IgGl . 

XX 

OS Homo sapiens. 
XX 

PN WO200181376-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 23-APR-2001; 2 001WO-US13 068 . 
XX 

PR 21-APR-2000; 2000US-198920P . 
XX 

PA (AMGE-) AMGEN INC . 
XX 

PI Kohno T; 
XX 

DR WPI; 2002-049262/06. 

DR N-PSDB; AAI67658. 
XX 

PT Recombinant or modified therapeutic agents having Apo-AI amphipathic 

PT helix peptide activity useful in treatment of hypercholesterolemia and 

PT viral infections such as herpes simplex virus, human immunodeficiency 

PT virus 
XX 

PS Claim 8; Fig 3A-B; 49pp; English. 
XX 

CC The invention provides a composition comprising a therapeutic agent that 

CC has activity similar to Apo-AI amphipathic helix peptide, but with better 

CC pharmaceutical characteristics attached to a vehicle through the 

CC peptide's N-terminus or C-terminus having a specified formula. The 

CC peptide mimic has greater half-life compared to conventional Apo-AI 

CC amphipathic helix peptide. The compositions are useful for treating 

CC hypercholesterolemia and viral infection such as HIV, HSV. The present 

CC sequence represents the human immunoglobulin (Ig) Gl Fc region which acts 

CC as a vehicle. 

XX 

SQ Sequence 228 AA; 

Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

I I M I I I I I I I I I I I I I I I M I I II I II I I I I I I I I I II I I I I I I I I ' I I I I I ! I I I I 
Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKAXiPAPI EKTI SKA 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

MIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIMI 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 II 1 1! 1 1 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 9 
ABB04279 

ID ABB04279 standard; Protein; 228 AA. 
XX 

AC ABB04279; 
XX 

DT 13-FEB-2002 (first entry) 
XX 

DE Human IgGl Fc domain. 
XX 

KW Glucagon antagonist; antidiabetic; anti-hormonal ; Fc domain; 

KW non-insulin dependent diabetes mellitus; human; immunoglobulin G; IgG. 

XX 

OS Homo sapiens. 
XX 

PN WO200183527-A2. 
XX 

PD 08-NOV-2 001. 
XX 

PF 03-MAY-2001; 2001WO-US1432 1 . 
XX 

PR 03-MAY-2000; 2000US-2 01436P . 
PR 02-MAY-2001; 2001US-084724 9 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Marshall WS, Stark KL; 
XX 

DR WPI; 2002-017738/02. 
DR N-PSDB; ABA03672. 
XX 

PT Compositions comprising glucagon antagonist domains, useful for 

PT treating diabetes mellitus - 

XX 

PS Claim 8; Fig 2; 54pp; English. 
XX 

CC The invention relates to compositions comprising a glucagon antagonist 
CC domain and a vehicle, such as a polymer (e.g. PEG or dextran) or, 
CC preferably, an Fc domain. The vehicle is covalently attached to the 
CC glucagon antagonist domain. The compositions are administered to 
CC treat non-insulin dependent diabetes mellitus. The present sequence 
CC is the human IgG Fc domain, which may be used as the vehicle 
CC in the compositions of the invention. 
XX 



SQ Sequence . 228 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKPNWYV 60 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKPNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 12 0 

II MM 1 1 MINI Mill I MM I Mill I III Ml II II I Mill II MINI III II 

Db 61 DGVEVHNAKTKPREEQYNSTYRWS VLTVXjHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IMIIIIIMMMMIMIIMIIIIIIIIIIIIIIIIIIIIIIIM 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 10 
ABJ38267 

ID ABJ38267 standard; Protein; 228 AA. 
XX 

AC ABJ38267; 
XX 

DT 12-JUN-2003 (first entry) 
XX 

DE Human IgGl Fc protein SEQ ID No 2 . 
XX 

KW TALL- 1 -binding protein; TALL-1; B-cell -mediated autoimmune disease; 

KW systemic lupus erythematosus; B-cell-mediated cancer; lymphoma ; 

KW inflammation; rheumatoid arthritis; acute pancreatitis; atherosclerosis; 

KW Alzheimer's disease; asthma; cachexia; cirrhosis; diabetes; osteoporosis; 

KW glomerulonephritis; Hashimoto's thyroiditis; ischaemic injury; psoriasis; 

KW multiple myeloma; multiple sclerosis; Parkinson's disease; vasculitis; 

KW gene therapy; human IgGlFc; human. 

XX 

OS Homo sapiens. 
XX 

PN WO200292620-A2. 
XX 

PD 21-NOV-2 002. 
XX 

PF 13-MAY-2002; 2002WO-US15273 . 
XX 

PR ll-MAY-2001; 2001US-290196P . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Min H, Hsu H; 
XX 

DR WPI; 2003-156719/15. 

DR N-PSDB; ABT33856. 



XX 

PT New TALL- 1 -binding polypeptide, useful for modulating the activity of 

PT TALL-1 and in treating, preventing or diagnosing a B-cell -mediated 

PT autoimmune diseases, cancers or lymphomas - 
XX 

PS Claim 36; Fig 3; 236pp; English. 
XX 

CC The invention relates to a novel TALL-1 -binding polypeptide comprising a 

CC defined sequence in the specification. The composition is useful in 

CC modulating the activity of TALL-1, and in treating, preventing, 

CC ameliorating, diagnosing or prognosing a B-cell-mediated autoimmune 

CC disease (e.g. systemic lupus erythematosus) or B-cell-mediated cancer or 

CC lymphoma. The composition may also be used in treating inflammations 

CC (e.g. rheumatoid arthritis), acute pancreatitis, Alzheimer's disease, 

CC asthma, atherosclerosis, cachexia, cirrhosis, diabetes, 

CC glomerulonephritis, Hashimoto's thyroiditis, ischaemic injury, multiple 

CC myeloma, multiple sclerosis, osteoporosis, Parkinson's disease, psoriasis 

CC and vasculitis. Disorders may be treated with the novel composition using 

CC gene therapy. This sequence represents a human IgGlFc protein relating to 

CC the TALL-1 sequence of the invention. 
XX 

SQ Sequence 228 AA; 



Query Match 100.0%; Score 1238; DB 24; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.2e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

1 1 1 - 1 1 I i 1 ' . ! i I ' I ! ! . I i I . ; ' I I I I 1 1 1 I ; I , . 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 11 
AAB17957 

ID AAB17957 standard; Protein; 243 AA. 
XX 

AC AAB17957; 
XX 

DT 31-OCT-2000 (first entry) 
XX 

DE Fc-MMP inhibitor fusion protein sequence SEQ ID NO: 1068. 
XX 

KW Modified peptide; therapeutic agent; fusion; Fc domain; cancer; 

KW autoimmune disease; cytostatic; antiasthmatic; thrombolytic; VEGF; 

KW immunosuppressive; EPO; TPO; CTLA4 ; mimetic; IL-1; TNF; antagonist; 



KW MMP; inhibitor; erythropoietin; thrombopoietin; interleukin 1; 

KW cytotoxic T cell lymphocyte antigen 4; tumour necrosis factor; 

KW vascular endothel ial growth factor; matrix metal loproteinase; 

KW asthma; thrombosis; pharmaceutical. 
XX 

OS Synthetic. 
XX 

PN WO200024782-A2 . 
XX 

PD 04-MAY-2000. 
XX 

PF 25-OCT-1999; 99WO-US25044 . 
XX 

PR 23-OCT-1998; 98US-0105371 . 

PR 22-OCT-1999; 99US-0428082 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham J, Boone TC; 
XX 

DR WPI; 2000-350702/30. 

DR N-PSDB; AAA69507. 
XX 

PT Novel composition of matter comprising an Fc domain and 

PT pharmacologically active peptides, useful for treating cancer and 

PT autoimmune diseases - 

XX 

PS Example 7; Page 585-586; 608pp; English. 
XX 

CC The present invention describes composition of matter (I) comprising an 

CC Fc domain, pharmacologically active peptides, and linkers. Where (I) is: 

CC (Xl)a-Fl- (X2)b, where: Fl = an Fc domain; XI and X2 = are each 

CC independently selected from -(Ll)c-Pl, - (Ll)'c-Pl- (L2) d-P2, 

CC - (Ll)c-Pl- (L2)d-P2- (L3)e-P^3, or - (LI) c-Pl- (L2) d-P2- (L3 ) e-P3 - (L4) f -P4 

CC where PI, P2 , P3 , and P4 = are each independently sequences of 

CC pharmacologically active peptides; LI, L2, L3 , and L4 = are' each 

CC independently linkers; and a, b, c, d, e, and f = are each independently 

CC 0 or 1, provided that at least 1 of a and b is 1. The composition can 

CC have cytostatic, antiasthmatic, thrombolytic and immunosuppressive 

CC activities. DNAs , vectors and host cells from the present invention can 

CC be used for producing pharmaceutical compositions. The compositions are 

CC useful for treating cancer, asthma, thrombosis, or autoimmune diseases. 

CC The use of an Fc domain (rather than a Fab domain) can provide a longer 

CC half-life or incorporate functions such as Fc receptor binding, protein 

CC A binding, complement fixation, and possibly placental transfer. AAA69443 

CC to AAA69526 and AAB16955 to AAB18003 represent nucleotide and amino acid 

CC sequences used in the exemplification of the present invention. 

XX 

SQ Sequence 243 AA; 



Query Match 100.0%; Score 1238; DB 21; Length 243; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTP 60 

IIIMIMIIIIIIIIIIIIIIIMIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

IIIIMIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIMIIIIIIIIIIIIIMIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKAXiPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

iiiiimiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiimiii 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I 
Db 181 S DGS F FL YS KLT VD KS RWQQGN VF S CS VMHEALHNH YTQKS LS LS PGK 228 

RESULT 12 
ABB73425 

ID ABB73425 standard; Protein; 243 AA. 
XX 

AC ABB73425; 
XX 

DT 05-APR-2002 (first entry) 
XX 

DE Fc-MMP inhibitor fusion nucleic acid SEQ ID NO: 1067. 
XX 

KW Modified peptide; mimetic; Fc domain; fusion; immunoglobulin G; IgG; 

KW EPO; erythropoietin; TPO; tumour necrosis factor alpha inhibitor; 

KW TNF-alpha inhibitor; interleukin 1 antagonist; IL-1 antagonist; TMP; 

KW TPO mimetic peptide; EPO mimetic peptide; EMP; VEGF antagonist; 

KW MMP inhibitor; antiinflammatory; antitumour; immunosuppressive; 

KW cytostatic; antirheumatic; antiarthritic ; antidiabetic; ophthalmological ; 

KW antianaemic; anorectic; antiinf ertility; haemostatic; dermatological ; 

KW neuroprotective; inflammatory disease; autoimmune disease; tumour growth; 

KW cancer; rheumatoid arthritis; diabetic retinopathy; infertility; obesity; 

KW sleep disorder; neurological degenerative disease; anaemia; 

KW thrombocytopaenia; metastatic tumour; systemic lupus erythematosus ; 

KW Fanconi ' s syndrome . 

XX 

OS Homo sapiens . 

OS Synthetic. 
XX 

PN WO200183525-A2. 
XX 

PD 08-NOV-2001. 
XX 

PF 02-MAY-2001; 2001WO-US14310 . 
XX 

PR 03-MAY-2000; 2000US-0563286 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham JC, Boone TC, Gudas JM; 
XX 

DR WPI; 2002-130313/17. 

DR N-PSDB; ABL35775. 
XX 

PT Novel vehicle-peptide molecule or its multimers useful for treating 

PT inflammatory and autoimmune diseases, cancer, rheumatoid arthritis, 



PT diabetic retinopathy, obesity, sleep disorders and infertility - 
XX 

PS Example 7; Fig 25A-B; 176pp; English. 
XX 

CC The present invention describes a vehicle-peptide molecule (I) or its 

CC multimers. (I) can have antiinflammatory, antitumour, immunosuppressive, 

CC cytostatic, antirheumatic, antiarthritic, antidiabetic, ophthalmological , 

CC antianaemic, anorectic, antiinf ertility, haemostatic, dermatological and 

CC neuroprotective activities. (I) can be used as a therapeutic or 

CC prophylactic agent as well as for screening purposes. (I) is useful for 

CC diagnosing diseases characterised by dysfunction of their associated 

CC protein of interest, for identifying normal or abnormal proteins of 

CC interest, as a part of diagnostic kit to detect the presence of their 

CC proteins of interest in a biological sample. Additionally, (I) is useful 

CC for treating inflammatory and autoimmune diseases, tumour growth, cancer, 

CC rheumatoid arthritis, diabetic retinopathy, obesity, sleep disorders, 

CC infertility, and neurological degenerative diseases. (I), comprising 

CC EPO-mimetic compounds are useful for treating disorders characterised by 

CC low red blood cell levels such as anaemia. The TPO-mimetic comprising 

CC compounds are useful for treating conditions that involve an existing 

CC megakaryocyte/platelet deficiency or an expected megakaryocyte/platelet 

CC deficiency, such as thrombocytopaenia, aplastic anaemia, metastatic 

CC tumour which result in thrombocytopaenia, systemic lupus erythematosus, 

CC and Fanconi ' s syndrome. ABB72403 to ABB73426 and ABL35695 to ABL35777 

CC represent amino acid and nucleic acid sequences used in the 

CC exemplification of the present invention. 

XX 

SQ Sequence 243 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 243; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEvXFNWYV 60 

IIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMMIIII 

Db 61 DGVE VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 



Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

IIIIIIIIMIIIIIIIMMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMM 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIMIIIIIIMIIIIIIIIIIIIMIIIIMIIIIIMIIII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHMHYTQKSLSLSPGK 228 



RESULT 13 
AAB16958 

ID AAB16958 standard; Protein; 247 AA. 
XX 

AC AAB16958; 
XX 

DT 31-OCT-2000 (first entry) 



XX 

DE Fc-TMP protein sequence SEQ ID NO: 6. 
XX 

KW Modified peptide; therapeutic agent; fusion; Fc domain; cancer; 

KW autoimmune disease; cytostatic; antiasthmatic; thrombolytic; VEGF; 

KW immunosuppressive; EPO; TPO; CTLA4 ; mimetic; IL-1; TNF; antagonist; 

KW MMP; inhibitor; erythropoietin; thrombopoietin; interleukin 1; 

KW cytotoxic T cell lymphocyte antigen 4; tumour necrosis factor; 

KW vascular endothelial growth factor; matrix metalloproteinase; 

KW asthma; thrombosis; pharmaceutical. 
XX 

OS Homo sapiens . 

OS Synthetic. 
XX 

PN WO200024782-A2 . 
XX 

PD 04-MAY-2000. 
XX 

PF 25-OCT-1999; 99WO-US25044 . 
XX 

PR 23-OCT-1998; 98US- 0105371 . 

PR 22-OCT-1999; 99US- 0428 082 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham J, Boone TC; 
XX 

DR WPI; 2000-350702/30. 

DR N-PSDB; AAA69444. 
XX 

PT Novel composition of matter comprising an Fc domain and 

PT pharmacologically active peptides, useful for treating cancer and 

PT autoimmune diseases - 

XX 

PS Claim 21; Page 179-180; 608pp ; English. 
XX 

CC The present invention describes composition of matter (I) comprising an 

CC Fc domain, pharmacologically active peptides, and linkers. Where (I) is: 

CC (Xl)a-Fl- (X2)b, where: Fl = an Fc domain; XI and X2 = are each 

CC independently selected from -(LI) c-Pl, - (LI) c-Pl- (L2) d-P2, 

CC - (Ll)c-Pl-(L2)d-P2-(L3)e-P A 3, or - (LI) c-Pl- (L2 ) d-P2- (L3) e-P3- (L4) f -P4 

CC where PI, P2, P3, and P4 - are each independently sequences of 

CC pharmacologically active peptides; LI, L2, L3 , and L4 = are each 

CC independently linkers; and a, b, c, d, e, and f = are each independently 

CC 0 or 1, provided that at least 1 of a and b is 1. The composition can 

CC have cytostatic, antiasthmatic, thrombolytic and immunosuppressive 

CC activities. DNAs , vectors and host cells from the present invention can 

CC be used for producing pharmaceutical compositions. The compositions are 

CC useful for treating cancer, asthma, thrombosis, or autoimmune diseases. 

CC The use of an Fc domain (rather than a Fab domain) can provide a longer 

CC half-life or incorporate functions such as Fc receptor binding, protein 

CC A binding, complement fixation, and possibly placental transfer. AAA69443 

CC to AAA69526 and AAB16955 to AAB18 003 represent nucleotide and amino acid 

CC sequences used in the exemplification of the present invention. 

XX 

SQ Sequence 247 AA; 



Query Match 100.0%; Score 1238; DB 21; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 120 

IIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMI 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIIIIMIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 SDGS FFLYS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 228 

I I I I I I I I I I I I I I I I I I [ I I I I I I I I II I I I I I ! I I II I I I I I 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCS VMHEALHNH YTQKSLSLS PGK 228 



RESULT 14 
ABB73411 

ID ABB73411 standard; Protein; 247 AA. 
XX 

AC ABB73411; 
XX 

DT 05-APR-2002 (first entry) 
XX 

DE Fc-TPO mimetic peptide (Fc-TMP) amino acid SEQ ID NO: 6. 
XX 

KW Modified peptide; mimetic; Fc domain; fusion; immunoglobulin G; IgG; 

KW EPO; erythropoietin; TPO; tumour necrosis factor alpha inhibitor; 

KW TNF-alpha inhibitor; interleukin 1 antagonist; IL-1 antagonist; TMP; 

KW TPO mimetic peptide; EPO mimetic peptide; EMP; VEGF antagonist; 

KW MMP inhibitor; antiinflammatory; antitumour; immunosuppressive; 

KW cytostatic; antirheumatic; antiarthritic; antidiabetic; ophthalmological ; 

KW antianaemic; anorectic; antiinf ertility; haemostatic; dermatological ; 

KW neuroprotective; inflammatory disease; autoimmune disease; tumour growth; 

KW cancer; rheumatoid arthritis; diabetic retinopathy; infertility; obesity; 

KW sleep disorder; neurological degenerative disease; anaemia; 

KW thrombocytopaenia ; metastatic tumour; systemic lupus erythematosus; 

KW Fanconi ' s syndrome . 

XX 

OS Homo sapiens. 

OS Synthetic. 
XX 

PN WO200183525-A2. 
XX 

PD 08-NOV-2001. 
XX 

PF 02-MAY-2001; 2001WO-US14310 . 
XX 

PR 03-MAY-2000; 2000US-0563286 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 



Feige U, Liu C, Cheetham JC, Boone TC, Gudas JM; 

WPI; 2002-130313/17. 
N-PSDB; ABL35761. 

Novel vehicle-peptide molecule or its multimers useful for treating 
inflammatory and autoimmune diseases, cancer, rheumatoid arthritis, 
diabetic retinopathy, obesity, sleep disorders and infertility - 

Claim 21; Fig 7; 176pp ; English. 

The present invention describes a vehicle-peptide molecule (I) or its 
multimers. (I) can have antiinflammatory, antitumour, immunosuppressive, 
cytostatic, antirheumatic, antiarthritic, antidiabetic, ophthalmological , 
antianaemic, anorectic, antiinf ertility, haemostatic, dermatological and 
neuroprotective activities. (I) can be used as a therapeutic or 
prophylactic agent as well as for screening purposes. (I) is useful for 
diagnosing diseases characterised by dysfunction of their associated 
protein of interest, for identifying normal or abnormal proteins of 
interest, as a part of diagnostic kit to detect the presence of their 
proteins of interest in a biological sample. Additionally, (I) is useful 
for treating inflammatory and autoimmune diseases, tumour growth, cancer, 
rheumatoid arthritis, diabetic retinopathy, obesity, sleep disorders, 
infertility, and neurological degenerative diseases, (r) , comprising 
EPO-mimetic compounds are useful for treating disorders characterised by 
low red blood cell levels such as anaemia. The TPO-mimetic comprising 
compounds are useful for treating conditions that involve an existing 
megakaryocyte/platelet deficiency or an expected megakaryocyte/platelet 
deficiency, such as thrombocytopaenia, aplastic anaemia, metastatic 
tumour which result in thrombocytopaenia, systemic lupus erythematosus, 
and Fanconi's syndrome. ABB72403 to ABB73426 and ABL35695 to ABL35777 
represent amino acid and nucleic acid sequences used in the 
exemplification of the present invention. 

Sequence 247 AA; 



Query Match 100.0%; Score 1238; DB 23; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


MDKTHTCPPCPAPELIX3GPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 

MMIMIIIIIIIIMIIIMIIIIIIIIMIIMIIIIIIIIIIIIMIIIIIIIIM 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 


60 


Db 


l 


60 


Qy 


61 


DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 

MIMIIIMIMIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIMIIIIIII 

DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


120 


Db 


61 


120 


Qy 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 

IMIIIIIIIIMIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIMIIIIIIM 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 


Db 


121 


180 


Qy 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MIIMIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 




Db 


181 





PI 

XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



RESULT 15 
AAB17951 

ID AAB17951 standard; Protein; 248 AA. 
XX 

AC AAB17951; 
XX 

DT 31-OCT-2000 (first entry) 
XX 

DE Fc-TNF -alpha inhibitor fusion protein sequence SEQ ID NO: 1056. 
XX 

KW Modified peptide; therapeutic agent; fusion; Fc domain; cancer; 

KW autoimmune disease; cytostatic; antiasthmatic; thrombolytic; VEGF; 

KW immunosuppressive; EPO; TPO; CTLA4; mimetic; IL-1; TNF; antagonist ; 

KW MMP; inhibitor; erythropoietin; thrombopoietin; interleukin 1; 

KW cytotoxic T cell lymphocyte antigen 4; tumour necrosis factor ; 

KW vascular endothelial growth factor; matrix metal loproteinase; 

KW asthma; thrombosis; pharmaceutical. 
XX 

OS Synthetic. 
XX 

PN WO200024782-A2 . 
XX 

PD 04-MAY-2000. 
XX 

PF 25-OCT-1999; 99WO-US25044 . 
XX 

PR 23-OCT-1998; 98US- 0105371 . 

PR 22-OCT-1999; 99US- 0428082 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham J, Boone TC; 
XX 

DR WPI; 2000-350702/30. 

DR N-PSDB; AAA69501. 
XX 

PT Novel composition of matter comprising an Fc domain and 

PT pharmacologically active peptides, useful for treating cancer and 

PT autoimmune diseases - 

XX 

PS Example 4; Page 568-569; 608pp ; English. 
XX 

CC The present invention describes composition of matter (I) comprising an 

CC Fc domain, pharmacologically active peptides, and linkers. Where (I) is: 

CC (Xl)a-Fl- (X2)b, where: Fl = an Fc domain; XI and X2 = are each 

CC independently selected from -(Ll)c-Pl, - (LI) c-Pl- (L2) d-P2 , 

CC - (Ll)c-Pl- (L2)d-P2- (L3)e-P A 3, or - (LI) c-Pl- (L2)d-P2- (L3) e-P3- (L4) f-P4 

CC where PI, P2, P3 , and P4 = are each independently sequences of 

CC pharmacologically active peptides; LI, L2, L3, and L4 = are each 

CC independently linkers; and a, b, c, d, e, and f = are each independently 

CC 0 or 1, provided that at least 1 of a and b is 1. The composition can 

CC have cytostatic, antiasthmatic, thrombolytic and immunosuppressive 

CC activities. DNAs, vectors and host cells from the present invention can 

CC be used for producing pharmaceutical compositions. The compositions are 

CC useful for treating cancer, asthma, thrombosis, or autoimmune diseases. 

CC The use of an Fc domain (rather than a Fab domain) can provide a longer 

CC half-life or incorporate functions such as Fc receptor binding, protein 



CC A binding, complement fixation, and possibly placental transfer. AAA69443 

CC to AAA69526 and AAB16955 to AAB18003 represent nucleotide and amino acid 

CC sequences used in the exemplification of the present invention. 
XX 

SQ Sequence 248 AA; 



Query Match 100.0%; Score 1238; DB 21; Length 248; 

Best Local Similarity 100.0%; Pred. No. 1.3e-91; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 


60 






1 1 1 III ! hill 1 Mllll ! IIE'III 1 1 lllllll II 




Db 


l 


MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 


60 


Qy 


61 


DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


120 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIII 




Db 


61 


DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


120 


Qy 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 






I II.: i 1 1 II. Mil ! 1 II Ml U .:l i 




Db 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 


Qy 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 








1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 





Search completed: January 16, 2004, 15:57:06 
Job time : 42 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 16, 2004, 15:56:15 ; Search time 22 Seconds 

(without alignments) 
438.494 Million cell updates/sec 

US-09-843-221A-2 
1238 

1 MDKTHTCPPCPAPELLGGPS MHEALHNHYTQKSLSLSPGK 228 



BLOSUM62 
Gapop 10.0 



328717 



Gapext 0 . 5 

Searched: 328717 seqs, 42310858 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1: / cgn2_6 /p t o da t a / 1 / i a a / 5 A__COMB . p ep : * 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/l/ iaa/6A_C0MB . pep : * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


1234 


99 


.7 


229 


4 


US-09-122-144-2 


Sequence 2, Appli 


2 


1234 


99 


.7 


347 


1 


US-07-940-861-43 


Sequence 43, Appl 


3 


1234 


99 


.7 


347 


1 


US-08-459-512-43 


Sequence 43, Appl 


4 


1234 


99 


7 


347 


2 


US-08-459-657-43 


Sequence 43, Appl 


5 


1234 


99 


7 


347 


2 


US-08-460-132-43 


Sequence 43, Appl 


6 


1234 


99 


7 


347 


3 


US-08-466-465-8 


Sequence 8, Appli 


7 


1234 


99 


7 


347 


5 


PCT-US92-02050-43 


Sequence 43 , Appl 


8 


1234 


99 


7 


446 


4 


US-09-157-452B-12 


Sequence 12, Appl 


9 


1234 


99. 


7 


482 


4 


US-09-189-129-2 


Sequence 2, Appli 


10 


1233 


99. 


6 


232 


2 


US-08-595-043A-50 


Sequence 50, Appl 


11 


1233 


99. 


6 


235 


3 


US-09-131-247-6 


Sequence 6, Appli 



1 o 
Iz 


1233 


99 


. 6 


331 


3 


US -09-178-869-2 


Sequence 


2, Appli 


1 1 

lj 




99 


. 6 


331 


4 


US -09 -761-413 -2 


Sequence 


2, Appli 


14 


1 o o o 

1233 


99 


. 6 


360 


4 


US-09-180-100-H 


Sequence 


11, Appl 


lb 


1233 


99 


. 6 


371 


1 


US -08 -236-311-7 


Sequence 


7, Appli 


16 


1233 


99 


. 6 


371 


3 


US -08-457-918-7 


Sequence 


7, Appli 


1 / 


n o o "5 
12 J J 


y 9 


fZ 


376 


4 


US -09-180-100-22 


Sequence 


22, Appl 


1 Q 
1 O 


1 O O O 
1Z J J 


y y 


. b 


387 


1 


US - 08-470-299-4 


Sequence 


4, Appli 


i y 


T O O O 
12 J J 


y y 


. b 


388 


3 


US -09-131-247-16 


Sequence 


16, Appl 


2 U 


"1 O 1 "D 

1233 


99 


. 6 


38 9 


3 


US -09-131-24 7- 14 


Sequence 


14, Appl 


2 1 


1 O T "3 
12 J J 


99 


. 6 


396 


2 


US -08-784-5 12 -3 


Sequence 


3, Appli 


2Z 


1 *") T T 


y y 


. b 


3 96 


3 


US - 09-176-228-3 


Sequence 


3, Appli 


zo 


1 O "3 Q 


y y 


b 


424 


4 


US-09-333-593A-8 


Sequence 


8, Appli 


Z4 


1 O O "3 
12J J 


y y 


6 


424 


5 


PCT-US95 -03866 -12 


Sequence 


12, Appl 


2b 


1233 


99 


6 


424 


5 


PCT-US95-03866-14 


Sequence 


14, Appl 


ZD 


1 o o o 

1233 


99 


6 


437 


5 


PCT-US96- 10043 -11 


Sequence 


11, Appl 


2 / 


12 J J 


99 


6 


442 


5 


PCT-US96-10043-9 


Sequence 


9, Appli 


2 O 


"1 1 *3 "3 


y 9 


ez 

6 


446 


3 


US- 08 -3 97 -4 11 -7 


Sequence 


7, Appli 


z y 


12 JO 


99 


6 


449 


1 


US -08 -458-516-13 


Sequence 


13, Appl 


O U 


1 O O 1 
12 J J 


99 


6 


459 


1 


US-08-157-101A-7 


Sequence 


7, Appli 


o 1 


t n 'D o 
1233 


99 


6 


475 


4 


US -09-74 0-002 -27 


Sequence 


27, Appl 


JZ 


12 J J 


9 9 


6 


476 


2 


US -08-378- 93 9-10 


Sequence 


10, Appl 




"1 T T *3 
12 J J 


y y 


tz 
b 


476 


3 


US -08 -487-550-4 


Sequence 


4, Appli 


"3 A 


12 j J 


y y 


fZ 

b 


476 


3 


US -08 -487-550-12 


Sequence 


12, Appl 


JO 


12 J J 


99 . 


6 


476 


4 


US -09- 52 6-098-4 


Sequence 


4, Appli 


J D 


1 o o o 

1233 


99 . 


6 


476 


4 


US- 09 -526- 098 -12 


Sequence 


12, Appl 


J / 


1233 


99 , 


6 


478 


3 


US-08-487-550-8 


Sequence 


8, Appli 


J o 


1 

j j 


Q Q 


D 


A 7Q 


4 


TTC HQ COf AQD D 


Sequence 


8, Appli 


39 


1233 


99. 


6 


488 


3 


US-08-776-511-2 


Sequence 


2, Appli 


40 


1233 


99. 


6 


592 


4 


US-09-313-942-8 


Sequence 


8, Appli 


41 


1233 


99. 


6 


680 


3 


US-08-227-496C-15 


Sequence 


15, Appl 


42 


1233 


99. 


6 


691 


4 


US-09-313-942-20 


Sequence 


20, Appl 


43 


1233 


99. 


6 


694 


4 


US-09-313-942-22 


Sequence 


22, Appl 


44 


1233 


99. 


6 


859 


4 


US-09-313-942-7 


Sequence 


7, Appli 


45 


1233 


99. 


6 


951 


4 


US-09-313-942-9 


Sequence 


9, Appli 



ALIGNMENTS 



RESULT 1 
US-09-122-144-2 

; Sequence 2, Application US/09122144A 

; Patent No. 6485726 

; GENERAL INFORMATION: 

; APPLICANT: Blumberg, Richard S. 

; APPLICANT: Simister, Neil E. 

; APPLICANT: Lencer, Wayne I. 

; TITLE OF INVENTION: RECEPTOR SPECIFIC TRANSEPITHELIAL TRANSPORT OF 
THERAPEUTICS 

; FILE REFERENCE: S1383/7003 

; CURRENT APPLICATION NUMBER: US/09/122 , 144A 

CURRENT FILING DATE: 1998-07-24 
; EARLIER APPLICATION NUMBER: US 09/122,144 

EARLIER FILING DATE: 1998-07-24 

NUMBER OF SEQ ID NOS : 22 
; SEQ ID NO 2 

LENGTH: 22 9 



TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-122-144-2 



Query Match 99.7%; Score 1234; DB 4; Length 229; 

Best Local Similarity 99.6%; Pred. No. 3.5e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
QY 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCV^VDVSHEDPEVKFNWYV 60 

HIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIII 

Db 2 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 61 

QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IMIIIIMIIIMIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIII 

Db 62 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 121 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

II I M Mi M I II III II MINI I II III II Mill MMII Mill I MM III INI 

Db 122 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 181 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

M I II II I II I II II I I I I II II I I II M I I I I II I I I II M I I I II I 
Db 182 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 22 9 



RESULT 2 

US-07-940-861-43 

Sequence 43, Application US/07940861 
Patent No. 5547853 
GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 
APPLICANT: WALLNER , Barbara P. 
APPLICANT: MILLER, Glenn T. 
APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS : 
ADDRESSEE: Fish & Neave 
STREET: 875 Third Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S.A. 
ZIP: 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/940 , 8 61 
FILING DATE: 21 -OCT- 19 92 
CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02 050 
FILING DATE: 12-MAR-1992 
PRIOR APPLICATION DATA: 



APPLICATION NUMBER: US 07/667,971 
FILING DATE: 12-MAR-1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 
FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT INFORMATION: 
NAME: HALEY, James F. , Jr. 
REGISTRATION NUMBER: 27,794 
REFERENCE / DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212 ) 715-0600 
TELEFAX: (212) 715-0673 
TELEX: 14-8367 
INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 347 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-07-940-861-43 

Query Match 99.7%; Score 1234; DB 1; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

MDKTHTCPPCPAPELLGGPS VFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 6 0 

Hlllllllllllllllllllllllllllllllliilliiiiiiiiiiiiiiiiiiiin 



Ml M MM MM Ml I MM MM Ml MMI! MMMM I! MMI 

DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKi 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVL] 

Ml MUM M MMI Ml III I MM MUM MMI IMMIIM 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVL] 
SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I IMIMMMMMMM III I MM MMMM 



Qy 


1 


Db 


120 


Qy 


61 


Db 


180 


Qy 


121 


Db 


240 


Qy 


181 


Db 


300 



RESULT 3 

US-08-459-512-43 

Sequence 43, Application US/08459512 
Patent No. 5728677 
GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 
APPLICANT: WALLNER, Barbara P. 
APPLICANT: MILLER, Glenn T. 
APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: Fish & Neave 
STREET: 875 Third Avenue 



CITY: New York 
STATE: New York 
COUNTRY: U.S.A. 
ZIP : 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/459,512 
FILING DATE: 02-JUN-1995 
CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02050 
FILING DATE: 12-MAR-1992 
APPLICATION NUMBER: US 07/667,971 
FILING DATE: 12-MAR-1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 
FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT INFORMATION: 
NAME: HALEY, James F., Jr. 
REGISTRATION NUMBER : 27,794 
REFERENCE / DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212)715-0600 
TELEFAX: ( 2 12 ) 715 -0673 
TELEX: 14-8367 
INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 347 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-459-512-43 

Query Match 99.7%; Score 1234; DB 1; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I | II | | | | | | | | 
Db 120 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 179 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 12 0 

IIIIIIIIIIIIMIIMIIIIMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 180 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 239 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

IIIMIMMIMMIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIII 

Db 240 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 2 99 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 3 00 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 



RESULT 4 

US-08-459-657-43 

Sequence 43, Application US/08459657 
Patent No. 5914111 
GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 
APPLICANT: WALLNER, Barbara P. 
APPLICANT: MILLER, Glenn T. 
APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: Fish & Neave 
STREET: 875 Third Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S.A. 
ZIP: 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/459,657 
FILING DATE: 02-JUN-1995 
CLASSIFICATION: 424 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02 050 
FILING DATE: 12 -MAR- 1992 
APPLICATION NUMBER: US 07/667,971 
FILING DATE: 12-MAR-1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 
FILING DATE: 07-OCT-1991 
ATTORNEY / AGENT INFORMATION: 
NAME: HALEY, James F., Jr. 
REGISTRATION NUMBER : 27,794 
REFERENCE/DOCKET NUMBER: B151CIP2 
TELECOMMUNI CATION INFORMATION : 
TELEPHONE: (212) 715-0600 
TELEFAX: (212) 715-0673 
TELEX: 14-8367 
INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 347 amino acids 
TYPE: amino acid 
TOPOLOGY : 1 inear 
MOLECULE TYPE: protein 
US-08-459-657-43 

Query Match 99.7%; Score 1234; DB 2; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 



Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

MMMMI > I I I Ml I Mill I MMMIMM MIMI Ml 

Db 120 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 179 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

I Mill I IIIIIIII MINI MINIMI Mill I II MM II II III II I II I II II I 

Db 180 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 239 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I II I I I I I I I II I I I I I I I I I I I I I 
Db 24 0 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 299 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MMI IIMIMMMMMI MMI Ml III MMI III 

Db 300 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 

RESULT 5 

US-08-460-132-43 

Sequence 43, Application US/08460132 
Patent No. 5928643 
GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 
APPLICANT: WALLNER , Barbara P. 
APPLICANT: MILLER, Glenn T. 
APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: Fish & Neave 
STREET: 875 Third Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S.A. 
ZIP: 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE : Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/460 , 132 
FILING DATE: 02-JUN-1995 
CLASSIFICATION: 424 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/940,861 
FILING DATE: 21-OCT-1992 
APPLICATION NUMBER: PCT/US92/02050 
FILING DATE: 12-MAR-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/667,971 
FILING DATE: 12 -MAR- 1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 
FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT INFORMATION: 



NAME: HALEY, James F., Jr. 
REGISTRATION NUMBER: 27,794 
REFERENCE / DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212 ) 715-0600 
TELEFAX: (2 12 ) 715 -0673 
TELEX: 14-8367 
INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTER I STI CS : 
LENGTH: 347 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-460-132-43 

Query Match 99.7%; Score 1234/ DB 2; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
QY 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 120 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 179 

QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

II MINIMI II I llll I MM I Mill 1 1 Ml II MINI III lllllll Mill IN 

Db 180 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 239 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I i I I I I I I I I 
Db 240 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 299 

Qy 181 SDGS FFL YS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 228 

M I II 1 1 II II 1 1 II II I M I M II I II 1 1 II M 1 1 II II M II M M 

Db 300 SDGS FFLYS KLTVDKSRWQQGNVFS CSVMHEALHNHYTQKSLSLS PGK 347 



RESULT 6 
US-08-466-465-8 

; Sequence 8, Application US/08466465 

; Patent No. 6162432 

; GENERAL INFORMATION: 

APPLICANT: Wallner, Barbara P. 

APPLICANT: Cooper, Kevin D. 

TITLE OF INVENTION: Method of Prophylaxis or Treatment of Antigen 
TITLE OF INVENTION: Presenting Cell Driven Skin Conditions Using 
TITLE OF INVENTION: Inhibitors of the CD2/LFA-3 Interaction 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD 

STREET: 60 State Street, Suite 510 

CITY: Boston 

STATE: Massachusetts 

COUNTRY : USA 

ZIP: 02109-1875 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/466 , 465 
FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/ 08755 
FILING DATE: 06-OCT-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/862,022 
FILING DATE: 12-APR-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770 , 969 
FILING DATE: 07-OCT-1991 
ATTORNEY/ AGENT I NFORMATI ON : 
NAME: Myers, Louis (PLM) 
REGISTRATION NUMBER: 35,965 
REFERENCE/DOCKET NUMBER: BGP-111CP 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 227-7400 
TELEFAX: (617) 227-5941 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 347 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-466-465-8 

Query Match 99.7%; Score 1234; DB 3; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

HIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIMIIIIII 

Db 120 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 179 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

MM I M IMIIMI Ml MMIM III MM IMMMI MIMI I 

Db 180 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 239 

QY 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

II I II I MINI III MMIM Ml Ml I MINI MINIMUM MINI II INN 

Db 24 0 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 299 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

II IIMM'IIIII! IIIMIMMIIIIIIIIII Mill Ml 

Db 3 00 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 



RESULT 7 

PCT-US92-02050-43 

; Sequence 43, Application PC/TUS9202050 
; GENERAL INFORMATION: 

APPLICANT: BIOGEN, INC. 

APPLICANT: WALLNER, Barbara P. 

APPLICANT: MILLER, Glenn T. 



APPLICANT: ROSA, Margaret D. 

TITLE OF INVENTION: CD2 -BINDING DOMAIN OF LYMPHOCYTE 
TITLE OF INVENTION: FUNCTION ASSOCIATED ANTIGEN 3 
NUMBER OF SEQUENCES : 43 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Neave 

STREET: 875 Third Avenue 

CITY: New York 

STATE: New York 

COUNTRY: U.S.A. 

ZIP : 10022-6250 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/02050 

FILING DATE: 19920312 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/667,971 

FILING DATE: 12 -MAR- 1991 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/770,967 

FILING DATE: 07-OCT-1991 
ATTORNEY/AGENT INFORMATION: 

NAME: HALEY, James F . , Jr. 

REGISTRATION NUMBER: 27,794 

REFERENCE/DOCKET NUMBER: B151CIP2 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (212)715-0600 

TELEFAX: (212)715-0673 

TELEX: 14-8367 
; INFORMATION FOR SEQ ID NO: 43: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 347 amino acids 

TYPE: AMINO ACID 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US92-02050-43 

Query Match 99.7%; Score 1234; DB 5; Length 347; 

Best Local Similarity 99.6%; Pred. No. 6.3e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 



Db 




Qy 



61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 



Db 




Qy 



121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 



Db 




Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I ' ! I > I ^1 I I . . I I > ! ! : 1 1 1 1 1 1 1 
Db 300 SDGSFFLYSKLTVDKSRVJQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 347 



RESULT 8 

US-09-157-452B-12 

Sequence 12, Application US/09157452B 
Patent No. 6482409 
GENERAL INFORMATION: 
APPLICANT: Lobb, Roy R. 
APPLICANT: Burkly, Linda C. 

TITLE OF INVENTION: TREATMENT FOR INFLAMMATORY BOWEL DISEASE 
FILE REFERENCE: 10274-004003 
CURRENT APPLICATION NUMBER: US/09/157 , 452B 
CURRENT FILING DATE: 1998-09-21 
PRIOR APPLICATION NUMBER: US 08/950,660 
PRIOR FILING DATE: 1997-10-15 
PRIOR APPLICATION NUMBER: US 08/373,857 
PRIOR FILING DATE: 1995-01-18 
PRIOR APPLICATION NUMBER: US 08/284,603 
PRIOR FILING DATE: 1994-08-11 
PRIOR APPLICATION NUMBER: PCT/US93/00924 
PRIOR FILING DATE: 1993-02-02 
PRIOR APPLICATION NUMBER: US 07/835,139 
PRIOR FILING DATE: 1992-02-12 
NUMBER OF SEQ ID NOS : 16 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 12 
LENGTH: 446 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-157-452B-12 

Query Match 99.7%; Score 1234; DB 4; Length 446; 

Best Local Similarity 99.6%; Pred. No. 9.1e-117; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 6 0 

- 1 1 1 1 ' 1 1 J I J I l . i 1 1 I 1 : 1 1 , j l : I I 1 : 1 1 1 1 ^ I : 

VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 278 
DGVE VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIII 

DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 3 3 8 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

I I I I I I I I I i I I I I I I I I I I I I I I I I I I II I I I . I I I I I I I I I I I I I I M I I I I I I I I I 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 3 98 

SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIMIMIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIII 



Qy 


1 


Db 


219 


Qy 


61 


Db 


279 


Qy 


121 


Db 


339 


Qy 


181 


Db 


399 



RESULT 9 



US-09-189-129-2 

Sequence 2, Application US/09189129 
Patent No. 6323027 
GENERAL INFORMATION: 

APPLICANT: Burkly, Linda C 
APPLICANT: Benjamin, Christopher D 
APPLICANT: Hession, Catherine A 
APPLICANT: Whitty, Adrian 

TITLE OF INVENTION: COMMON GAMMA CHAIN BLOCKING AGENTS 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: Biogen, Inc. 
STREET: 14 Cambridge Center 
CITY: Cambridge 
STATE : Massachusetts 
COUNTRY : USA 
ZIP: 02142 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/189,129 
FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: AO 06 PCT CIP 
FILING DATE: 09-MAY-1997 
APPLICATION NUMBER: 60/017,466 
FILING DATE: 10-MAY-1996 
ATTORNEY/AGENT I NFORMATI ON : 
NAME: Kaplan, Warren A. 
REGISTRATION NUMBER: 34,199 
REFERENCE/DOCKET NUMBER: AO 06 PCT CIP 
TELECOMMUNICATION INFORMATION : 
TELEPHONE: 617 679-2000 
TELEFAX: 617 679-2838 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 4 82 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: 
HYPOTHETICAL: YES 
ANTI -SENSE: NO 
US-09-189-129-2 

Query Match 99.7%; Score 1234; DB 4; Length 482; 

Best Local Similarity 99.6%; Pred. No. le-116; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

QY 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

I ' I I I I ' I I 1 I I ' I I M I I I I I ' I , I I | | | | |' | I i I I | , | | . | | | , | I ' ! : 

Db 255 VI) KTHTCPPCPAPELLGGPSVFLFPPKPKDTLM I SRTPEVTCVWDVSHEDPEVKFNWYV 314 



QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 



IIIIIIIIIMIIIIIIIIIIIIIMIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

Db 315 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTISKA 374 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIIIMIIIIMIIIIIIMIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 375 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 434 

Qy 181 SDGSFFLYSKLTVTDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIMIIIIIIIIMIIIIIMIIIIMIMIIIIIIIIIM 

Db 435 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 482 

RESULT 10 
US-08-595-043A-50 

Sequence 50, Application US/08595043A 
Patent No. 5935824 
GENERAL INFORMATION: 

APPLICANT: SGARLATO , GREGORY D . 
TITLE OF INVENTION: PROTEIN EXPRESSION SYSTEM 
NUMBER OF SEQUENCES: 90 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: MEDLEN & CARROLL 
STREET: 220 MONTGOMERY STREET, SUITE 2200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 

COUNTRY: UNITED STATES OF AMERICA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/595 , 043A 
FILING DATE: 31-JAN-1996 
CLASSIFICATION: 435 
ATTORNEY/AGENT I N FORMAT I ON : 
NAME: CARROLL, PETER G. 
REGISTRATION NUMBER: 32,837 
REFERENCE/DOCKET NUMBER: SGAR- 00371 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 705-8410 
TELEFAX: (415) 397-8338 
INFORMATION FOR SEQ ID NO: 50: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 232 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-595-043A-50 

Query Match 99.6%; Score 1233; DB 2; Length 232; 

Best Local Similarity 100.0%; Pred. No. 4.5e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
QY 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEOTCV^ 61 

II II II MINN II III II I Mill Mill Mill MINIMI Mill III MINN 



Db 6 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 65 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

IIIIIIMIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 66 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGPCEYKCKVSNKALPAPI EKTI SKAK 125 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IIIMIMIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 126 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 185 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

Db 186 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 232 



RESULT 11 
US-09-13*l-247-6 

; Sequence 6, Application US/09131247 

; Patent No. 6294170 

; GENERAL INFORMATION: 

; APPLICANT: Boone, Thomas C. 

; APPLICANT: Hershenson, Susan 

; APPLICANT: Bevilacqua, Michael P. 

; APPLICANT: Collins, David S. 

; TITLE OF INVENTION: COMPOSITION AND METHOD FOR TREATING INFLAMMATORY 
; TITLE OF INVENTION: DISEASES 
; FILE REFERENCE: A-365F 

; CURRENT APPLICATION NUMBER: US/09/13 1 , 247 

; CURRENT FILING DATE: 1998-08-07 

; EARLIER APPLICATION NUMBER: 60/055,185 

; EARLIER FILING DATE: 1997-08-08 

; EARLIER APPLICATION NUMBER: PCT/US 97/02131 

; EARLIER FILING DATE: 1997-02-10 

; NUMBER OF SEQ ID NOS : 16 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 6 

LENGTH: 235 

TYPE: PRT 

ORGANISM: Human 
US-09-131-247-6 



Query Match 99.6%; Score 1233; DB 3; Length 235; 

Best Local Similarity 100.0%; Pred. No. 4.6e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 61 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 9 DKTHTCPPCPAPELLGGPSVFLFPPKPKDT^ISRTPEVTCVVVDVSHEDPEVKFNWYVD 68 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

MMMIIMIMMIIIMIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIII 

Db 69 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 128 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

K . IIIIIMIIMIMIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 129 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 188 



Qy 182 DGS FFLYSKLTVDKSRWQQGNVFS CS VMHEALHNHYTQKSLSLS PGK 228 

I I I I II. Mil! i : I : 

Db 18 9 DGSFFLYSKLTVT)KSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 235 



RESULT 12 
US-09-178-869-2 

; Sequence 2, Application US/09178869B 

; Patent No. 6197294 

; GENERAL INFORMATION: 

; APPLICANT: Tao, Weng 

; APPLICANT: Wong, Shou 

; APPLICANT: Hickey, William F 

; APPLICANT: Hammang, Joseph P. 

; APPLICANT: Baetge, E. Edward 

; TITLE OF INVENTION: CELL SURFACE- INDUCED MACROPHAGE ACTIVATION 
; FILE REFERENCE: 17810-043 

; CURRENT APPLICATION NUMBER: US/09/178 , 869B 

; CURRENT FILING DATE: 1998-10-26 

; NUMBER OF SEQ ID NOS : 14 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 2 

LENGTH: 331 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-178-869-2 

Query Match 99.6%; Score 1233; DB 3; Length 331; 

Best Local Similarity 100.0%; Pred. No. 7.5e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

M III III II III I IMM Mill I MMM MIM IMIMI I I 

Db 105 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 164 

QY 62 GVEVIINAKTKPREEQYNSTYRWSvIjTvIjHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

M 1 1 1 1 1 1 1 1 1 1 ! i : 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 ! I ' 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 i 

Db 165 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 224 

QY 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IIMIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIMI 

Db 225 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 284 

Qy 182 DGS FFLYSKLTVDKSRWQQGNVFSCS VMHEALHNHYTQKSLSLS PGK 228 

IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMI 

Db 285 DGS FFLYSKLTVDKSRWQQGNVFSCS VMHEALHNHYTQKSLSLS PGK 331 



RESULT 13 
US-09-761-413-2 

; Sequence 2, Application US/09761413 

; Patent No. 6506891 

; GENERAL INFORMATION: 

; APPLICANT: Tao, Weng 

; APPLICANT: Wong, Shou 

; APPLICANT: Hickey, William F 

; APPLICANT: Hammang, Joseph P. 



; APPLICANT: Baetge, E . Edward 

; TITLE OF INVENTION: CELL SURFACE - I NDUCED MACROPHAGE ACTIVATION 

FILE REFERENCE: 17810-043 
; CURRENT APPLICATION NUMBER: US/09/761 , 4 13 
; CURRENT FILING DATE: 2001-01-16 

PRIOR APPLICATION NUMBER: US/09/178 , 869 

PRIOR FILING DATE: 1998-10-26 
; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 2 

LENGTH: 331 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-761-413-2 



Query Match 99.6%; Score 1233; DB 4; Length 331; 

Best Local Similarity 100.0%; Pred. No. 7.5e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

IIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 164 
GVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKE YKCKVSNKALPAP I EKTI SKAK 121 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMMIIIIIIIIIIIIIMIIIII 

GVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKE YKCKVSNKALPAP I EKTI SKAK 224 
GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIII 

GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 284 
DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIMI 

DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 331 



Qy 


2 


Db 


105 


Qy 


62 


Db 


165 


Qy 


122 


Db 


225 


Qy 


182 


Db 


285 



RESULT 14 
US-09-180-100-11 

; Sequence 11, Application US/09180100 
; Patent No. 6306395 
; GENERAL INFORMATION: 

; APPLICANT: NAKAMURA, No. 6306395io 
; APPLICANT: NAGATA , Shigekazu 

; TITLE OF INVENTION: NOVEL Fas ANTIGEN DERIVATIVE 

FILE REFERENCE: 1110-207P 
; CURRENT APPLICATION NUMBER: US/ 09/18 0 , 100 

CURRENT FILING DATE: 1998-11-02 

EARLIER APPLICATION NUMBER: PCT/ JP97/01502 

EARLIER FILING DATE: 1997-05-01 
; NUMBER OF SEQ ID NOS: 25 

SOFTWARE : Patentln Ver . 2.0 
; SEQ ID NO 11 
LENGTH: 360 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-180-100-11 



Query Match 99.6%; Score 1233; DB 4; Length 360; 

Best Local Similarity 100.0%; Pred. No. 8.4e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
QY 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

1 ! 1 1 1 1 , 1 i 1 1 1 1 1 l;l I J I . [ 1 1 1 1 1 1 1 1 , j 1 1 1 1 ' 1 1 j 1 1 

Db 134 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 193 

QY 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

IMIIIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIII 

Db 194 GVEVHNAKTKPREEQYNSTYRWS VTjTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 253 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IIIIIIMIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIMMIIIIIMIII 

Db 254 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 313 

QY 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

INI III MINI INN 1 1 MINIUM I MM I Mill II I Ml 

Db 314 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 360 

RESULT 15 
US-08-236-311-7 

Sequence 7, Application US/08236311 
Patent No. 5565335 
GENERAL INFORMATION: 

APPLICANT: Capon, Daniel J. 
APPLICANT: Gregory, Timothy J. 
TITLE OF INVENTION: Adheson Variants 
NUMBER OF SEQUENCES: 25 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Genentech, Inc. 
STREET: 460 Point San Bruno Blvd 
CITY: South San Francisco 
STATE: California 
COUNTRY : USA 
ZIP: 94080 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: patin (Genentech) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/236 , 311 
FILING DATE: 02-MAY-1994 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/936190 
FILING DATE: 26-AUG-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/842777 
FILING DATE: 18-FEB-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/250785 
FILING DATE: 28-SEP-1988 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/104329 



FILING DATE: 02-OCT-1987 
ATTORNEY / AGENT INFORMATION: 
; NAME: Hasak, Janet E. 

REGISTRATION NUMBER: 28 # 616 

REFERENCE/DOCKET NUMBER: 444P1C2 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415/225-1896 

TELEFAX: 415/952-9881 

TELEX: 910/371-7168 
; INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 371 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
US-08-236-311-7 

Query Match 99.6%; Score 1233; DB 1; Length 371; 

Best Local Similarity 100.0%; Pred. No. 8.8e-117; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 61 

Mill Ml I Mill IN Ml M I M Ml M I MINI Mill II I Mill II Mill I II 

145 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 204 
62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

IIIMIIIIIIIMIIIIIMIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

205 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 264 
122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIMIMIMIII 

265 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 324 
182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I I I ' I ' I : I . I ' i I Ml 1,1 ' I ' I ' I . I I i ' I ' | I i I 

325 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 371 



Search completed: January 16, 2004, 15:59:34 
Job time : 23 sees 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 16, 2004, 15:58:31 ; Search time 33 Seconds 

(without alignments) 
1412.779 Million cell updates/sec 

US-09-843-221A-2 
1238 

1 MDKTHTCPPCPAPELLGGPS MHEALHNHYTQKSLSLSPGK 228 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 762491 seqs, 204481190 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



762491 



Post -process ing : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Published_Applications_AA: * 
1 : / cgn2_6 /p t oda t a / 1 /pubpaa /US 0 7_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB .pep : * 
/ cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB .pep : * 
/cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB .pep : * 
/ cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB .pep : 
/ cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 
/ cgn2_6/ptodata/l/pubpaa/US09A_J>UBCOMB .pep : 
/cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep 
/cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 0 9_NEW_PUB . pep : 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 1 0 A_PUBCOMB . p ep 
/cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep 
/cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep 
/cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep 
/ cgn2_6 /p t oda t a / 1 /pubpaa /US 6 0__NEW_PUB . pep 
/cgn2_6/ptodata/l/pubpaa/US60__PUBCOMB.pep 



2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



* 

: * 
: * 
* 

+ 

: * 
* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 
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Sequence 
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Sequence 


2, Appli 
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Sequence 
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12 


T IP TA 0/"A AA/** — 1 >~i 

US- 10 -269-806-32 


Sequence 


32, Appl 


—1 
/ 


n o 0 0 
XZ J t) 


t a n 
1UU 


. 0 


Zz8 


12 


US -10- 14 5-206-2 


Sequence 


2, Appli 


0 
0 


1 O O O 

XZ J ts 


"1 A A 
100 


. 0 


243 


12 


US -10-269-806-152 


Sequence 


152, App 


Q 

y 


1 *7 7 O 
IzJ £5 


inn 
1UU 


A 

. 0 


250 


12 


US -10 -2 69-6 95 -243 


Sequence 


243, App 


i n 


XZ-3 O 


inn 
1UU 


A 
- U 


O r a 

zo 0 


12 


US- 10 -4 10 -998 -243 


Sequence 


243, App 


J. X 


1 TOO 

XZ JO 


inn 


A 
. U 


zbz 


10 


US - 09-840-277-109 


Sequence 


109, App 


1 9 
XZ 


1 O 1 Q 
XZ J 0 


inn 
1 U U 


A 


4 01 


12 


US -09-38 9 -782 -8 


Sequence 


8, Appli 


1 T 


i 0 7 /i 


y y 


. 7 


22 9 


14 


US -10 -2 15-2 97 -2 


Sequence 


2, Appli 


X4 


xzj 4 


y y 


. / 


zz y 


15 


T TO 1 j"i '"111 - *^irtn -^i 

US- 10 -2 15-2 98 -2 


Sequence 


2, Appli 


1 c; 

X 3 


1Z.54 


Q Q 

y y 


. / 


z4 0 


lz 


T TO 1A 1 1 r f\ f~ 11-1 

US- 10 -145-2 06-111 


Sequence 


111, App 


lb 


1 9 9 A 


Q Q 

y y 


. / 


z4 O 


lz 


T TO 1A -| a r — /^i 

US- 10 -145 -2 06 -112 


Sequence 


112, App 


1 1 
x / 


1 O 7 A 


Q Q 

y y 


—l 
. / 


"7 /l Q 

z4 O 


lz 


T TO 1A 1 ji 1 — •'s f\ f l-l^ 

US -10-145-206-113 


Sequence 


113, App 


1 Q 


1 O O A 

xzo4 


y y 


. 7 


252 


12 


US-10-145-206-114 


Sequence 


114, App 


1 Q 

1 y 


1 O "3 A 


y y 


. 1 


O IT O 

z5z 


12 


US -10-145-2 06 -115 


Sequence 


115, App 


0 n 
z u 


19 9/1 


yy 


—1 
. I 


zbz 


lz 


US- 10 -145 -2 06 -116 


Sequence 


116, App 


9 1 
z X 


1 9 9 A 
lzJ4 


y y 


. / 


*7 C A 
ZOZ 


lz 


T TO 1A 1 /I r ^/■\/" -1 -I 1 

US -10-145-206-117 


Sequence 


117, App 


9 9 
z z 


1 9 9 A 
XZ 0 4 


Q Q 

y y 


■7 


ncn 
ZOZ 


lz 


I TC 1 1A 1 /i r OA/" -1-10 

Ub -10-145-206-118 


Sequence 


118, App 


0 9 
Zo 


IOTA 
1zj4 


y y 


. / 


252 


12 


US -10-145-206-119 


Sequence 


119, App 


Z *t 


1 9 9 A 


y y 


•7 


OCT 

zbz 


lz 


US-10-145-2 06-120 


Sequence 


120, App 


9 R 
z 0 


1 99A 


Q Q 

y y 


•7 


zbz 


1 *7 

lz 


T TC 1A 1 /I r OA/* lOI 

Ub -10-145-206-121 


Sequence 


121, App 


9 £ 
z 0 


1 9 9 A 


y y 


•7 
1 


OCT 

zbz 


lz 


T TO 1A 1 /I 1 — *""*/\/"- 

Ub- 10 -145 -206 -122 


Sequence 


122, App 


97 


1 9 9 A 


Q q 

y y 


■7 


0 £ 1 
z b 1 


lz 


T TC 1A /lie /"AO /" 

Ub -10-435-608-6 


Sequence 


6, Appli 


9 Q 
Z O 


1 9 9 A 


y y 


•7 


zoo 


lz 


T TO *1A *1AA j— r~ r~ n 

US-10-3 90-566-8 


Sequence 


8, Appli 


9 Q 

z y 


1 9 "3 A 
Iz J4 


y y 


*7 


ZOO 


lz 


T TC 1A 1AA 1— /- t 1— 

US -10-390-566-15 


Sequence 


15, Appl 


9 n 


1 O 1 A 
1Z.54 


y y 


•-7 
/ 


"~\ r~ s~ 
ZOO 


12 


US -10-390-566 -21 


Sequence 


21, Appl 




1 9 9 A 
XZ J4 


y y 


"7 


•7 tl 
ZOO 


lz 


T TO "l/\ "1 n A i — /~ f <n>n 

US -10 -3 90-566-28 


Sequence 


28, Appl 


7 9 
J Z 


lz o4 


y y 


•-7 
/ 


0 ci 0 

z 93 


12 


US -10-145-206- 12 3 


Sequence 


123, App 


9 9 


1 O "5 A 

xz ,34 


y y 


•-7 

/ 


O A 1 

293 


12 


US -10-145-206-124 


Sequence 


124, App 


9 A 
J 4 


1 9 9 A 
XZ J 4 


y y 


•7 




15 


US -10-115-1 92 -12 


Sequence 


12, Appl 


9 R 
J O 


1 9 9 A 
XZ J 4 


Q Q 

y y 


■7 


"3 /I "7 
J4 / 


y 


T TO AA "~7A/~ n 

US- 09 -796-033 -8 


Sequence 8, Appli 


J D 


1 9 9 A 
X Z O 4 


yy . 


•7 
1 


-34 / 


9 


US -09 -730-465-8 


Sequence 8, Appli 


37 


1234 


99 . 


7 


94 7 


1 9 


TTQ- 1 H -9 9Q - t!QQ 0 

uo-xu-jzy-oyy-o 


Sequence 


8, Appli 


38 


1234 


99. 


7 


347 


14 


US-10-091-236-17 


Sequence 


17, Appl 


39 


1234 


99. 


7 


347 


15 


US-10-091-313-7 


Sequence 


7, Appli 


40 


1234 


99. 


7 


347 


15 


US-10-091-268-7 


Sequence 


7, Appli 


41 


1234 


99. 


7 


360 


12 


US-10-390-566-7 


Sequence 


7, Appli 


42 


1234 


99. 


7 


360 


12 


US-10-390-566-14 


Sequence 


14, Appl 


43 


1234 


99. 


7 


360 


12 


US-10-390-566-20 


Sequence 


20, Appl 


44 


1234 


99. 


7 


360 


12 


US-10-390-566-27 


Sequence 


27, Appl 


45 


1234 


99. 


7 


367 


12 


US-10-390-566-6 


Sequence 


6, Appli 



ALIGNMENTS 



RESULT 1 
US-09-847-712-2 

; Sequence 2, Application US/09847712 

; Patent No. US20020090646A1 

; GENERAL INFORMATION: 

; APPLICANT: LIU, CHUAN-FA 



; APPLICANT: MARSHALL, WILLIAM S. 
; APPLICANT: REYNOLDS, ANGELA 

; TITLE OF INVENTION: CALCITONIN -RELATED MOLECULES 

FILE REFERENCE: A-684 
; CURRENT APPLICATION NUMBER: US/ 09/847 , 712 
; CURRENT FILING DATE: 2001-05-02 
; PRIOR APPLICATION NUMBER: 60/201,511 
; PRIOR FILING DATE: 2000-05-03 
; NUMBER OF SEQ ID NOS : 2 6 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 228 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-847-712-2 

Query Match 100.0%; Score 1238; DB 9; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

MIMIIIIIIIIMIIIIIIIIIIMIIMIIIIIIMMIIIIIIIIIMMIIMM 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVljHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ll 1 1 

Db 61 DGVE VHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKE YKCKVSNKALPAP I EKTI SKA 12 0 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

MIIIIIIIIIIIIIIIIIIIIIIIIMIMIIMMIIIMIIIIIIIII MINIMI 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MIMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIMI 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 2 
US-09-840-277-2 

; Sequence 2, Application US/09840277 

; Patent No. US20020168363A1 

; GENERAL INFORMATION: 

; APPLICANT: FEIGE, ULRICH 

; APPLICANT: KOHNO, TADAHIKO 

; APPLICANT: LACEY, DAVID LEE 

; APPLICANT: BOONE, THOMAS CHARLES 

; TITLE OF INVENTION: INTEGRIN/ADHESION ANTAGONISTS 
FILE REFERENCE: A-688A 

CURRENT APPLICATION NUMBER: US/09/840,277 
; CURRENT FILING DATE: 2001-08-14 
; PRIOR APPLICATION NUMBER: 60/198,919 

PRIOR FILING DATE: 2000-04-21 
; PRIOR APPLICATION NUMBER: 60/201,394 
; PRIOR FILING DATE: 2000-05-03 
; NUMBER OF SEQ ID NOS: 135 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 



4 



LENGTH: 228 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-840-277-2 



Query Match 100.0%; Score 1238; DB 10; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 
Matches 228; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

MIIIIIIMIIIIIMIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 
DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKAL PAP I EKTI SKA 120 

IIIIMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTISKA 12 0 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

MIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 



I 1 I , I I I I I I I I I [ | | | | II I I M I I II I I I I I I I I I I M | | | | 



RESULT 3 

US-09-847-249A-2 

Sequence 2, Application US/09847249A 
Publication No. US20030032588A1 
GENERAL INFORMATION: 
APPLICANT: MARSHALL , WILLIAM S. 
APPLICANT: STARK, KEVIN LEE 
TITLE OF INVENTION: GLUCAGON ANTAGONIST 
FILE REFERENCE: A- 6 93 

CURRENT APPLICATION NUMBER: US/09/847,24 9A 
CURRENT FILING DATE: 2001-05-02 
PRIOR APPLICATION NUMBER: 60/201,436 
PRIOR FILING DATE: 2000-05-03 
NUMBER OF SEQ ID NOS : 80 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 228 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-847-249A-2 



Query Match 100.0%; Score 1238; DB 11; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 22 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

MIIIIMMIIIIIMIIIIIIIIIIIIIIIIIIIIIMMIIIMIIIIIIIMIMI 

1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 
Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

MIIMIIIIIIIIMIMIIIIMIMIIMIIIIIIMIIIIIIIIIIIIIIIIIIII 



Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY PSDIAVEWESNGQPENNYKTTPPVLD 180 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 
SDGS F FL YS KLT VDKS RWQQGN VF S CS VMHEALHNH YTQKS LS LS PGK 228 

Ml MM 1 1 III II Mill I MM I MM 1 1 III I llllll Mill II 

SDGS FFLYS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 228 
RESULT 4 

US-09-843-221A-2 

Sequence 2, Application US/09843221A 
Publication No. US20030039654A1 
GENERAL INFORMATION : 
APPLICANT: KOSTENUI K , PAUL 
APPLICANT: LIU, CHUAN-FA 
APPLICANT: LACEY, DAVID LEE 

TITLE OF INVENTION: MODULATORS OF RECEPTORS FOR PARATHYROID HORMONE AND 
PARATHYROID HORMONE - 

TITLE OF INVENTION: RELATED PROTEIN 
FILE REFERENCE: A-665B 

CURRENT APPLICATION NUMBER: US/09/843 , 22 1A 
CURRENT FILING DATE: 2001-04-26 
PRIOR APPLICATION NUMBER: 60/266,673 
PRIOR FILING DATE: 2001-02-06 
PRIOR APPLICATION NUMBER: 60/214,860 
PRIOR FILING DATE: 2000-06-28 
PRIOR APPLICATION NUMBER: 60/200/053 
PRIOR FILING DATE: 2000-04-27 
NUMBER OF SEQ ID NOS : 170 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 228 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-843-221A-2 

Query Match 10 0.0%; Score 123 8; DB 11; Length 228; 

Best Local Similarity 100.0%; Pred. No. l.,9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Ml Mill IMIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIMIIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

M I I M M I I I I I I M I II M M I II II II II II I I II II I II I M II I I II I I II I I I I 
Db 61 DGVEVIiNAKTKPREEQYNSTYRWSVljTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

1 1 M 1 1 1 II I II II I M I II I M II II II II 1 1 1 1 II II II 1 1 1 II I II I II I II II I M 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 S DGSFFLYSKLTVDKS RWQQGN VFSCS VMHEALHNH YTQKSLSLS PGK 228 

IMIIIMIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIMMII 



Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 5 

US-09-840-669B-2 

Sequence 2, Application US/09840669B 
Publication No. US20030040470A1 
GENERAL INFORMATION: 
APPLICANT: KOHNO, TADAHIKO 

TITLE OF INVENTION: APO-AI/AII PEPTIDE DERIVATIVES 
FILE REFERENCE: A-690 

CURRENT APPLICATION NUMBER: US/09/840 , 669B 
CURRENT FILING DATE: 2002-06-07 
PRIOR APPLICATION NUMBER: 60/198,920 
PRIOR FILING DATE: 2000-04-21 
NUMBER OF SEQ ID NOS : 11 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 228 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-840-669B-2 

Query Match 100.0%; Score 1238; DB 11; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIIIIMIIIIIMIIIIIIIIMMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 
)GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 12 0 

IIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIII 

JGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 12 0 
:GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

J 1 1 1 1 1 1 r 1 1 1 J 1 1 r 1 1 j 1 1 j 1 1 1 1 1 1 1 1 1 r r 1 1 1 1 1 1 1 1 1 j j 1 1 1 1 1 r i r r 1 1 1 1 1 1 1 

IGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDI AVEWESNGQPENNYKTTPPVLD 18 0 
IDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MIIIIIIMIIIIIMIIIIIIIIIIIIIIIMMIIIIIIIIMI 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



RESULT 6 

US-10-269-806-32 

; Sequence 32, Application US/10269806 

; Publication No. US20030176352A1 

; GENERAL INFORMATION: 

; APPLICANT : Min, Hosung 

; APPLICANT: Sitney, Karen 

; APPLICANT: Hartley, Cynthia 

; TITLE OF INVENTION: Peptides and Related Compounds Having Thrombopoietic 
Activity 

; FILE REFERENCE: A-750 

; CURRENT APPLICATION NUMBER: US/10/269 , 8 06 
; CURRENT FILING DATE: 2002-10-10 



; NUMBER OF SEQ ID NOS : 199 

SOFTWARE: Patentln version 3.1 
/ SEQ ID NO 32 

LENGTH : 228 

TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthesized Peptide Sequence 
US-10-269-806-32 



Query Match 100.0%; Score 1238; DB 12; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

I II I MMMMI Mill III III Mill Ml III Mill I Mill II III MIMIII I 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

I M Ml MINI 1 1111 1 1 II III II III III Ml II III I II III II Ml MINIM I 

DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 
KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

1 1 1 1 1 1 II 1 1 II I M 1 1 1 II M II II 1 1 II II M II I M 1 1 II II 1 1 1 1 1 II 1 1 1 1 M 1 1 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 
I M I I II II II II I I I I I I II II I I II II II I II II I II I II II I M I 



RESULT 7 
US-10-145-206-2 

; Sequence 2, Application US/10145206 

; Publication No. US20030195156A1 

; GENERAL INFORMATION: 

; APPLICANT: MIN, HOSUNG 

; APPLICANT: HSU, HAILING 

; APPLICANT: ZIONG, FEI 

; TITLE OF INVENTION: PEPTIDES AND RELATED MOLECULES THAT BIND TO TALL-1 
; FILE REFERENCE: A-743 

; CURRENT APPLICATION NUMBER: US/10/145 , 206 
CURRENT FILING DATE: 2002-05-13 
PRIOR APPLICATION NUMBER: US 60/290,196 
; PRIOR FILING DATE: 2001-05-11 
; NUMBER OF SEQ ID NOS: 197 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 228 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-145-206-2 



Query Match 100.0%; Score 1238; DB 12; Length 228; 

Best Local Similarity 100.0%; Pred. No. 1.9e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 


1 


MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 


60 






i i i i i i i i i i i i i i i i i t i i i i t i i i i i i i i i i i i i i i t i i i i i i i t i i i i i i i i i i i i i 

II II 1 1 1 II 1 1 II 1 II II II 1 II II II II II II II II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




nU 

UD 


1 


MDKTHTLPPLPAPhiJLl^JICJIFbV^ , lJ^ , PPKPKIjTIjMI SRTPEVTCVVVDVSHEDPE VKFNWYV 


60 


Qy 


61 


DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 


120 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i t i i i i i t > i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UD 


61 


DGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDW I EKTI SKA 


120 


Qy 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 






1 MIIIMIIIIIIII IMIIIMIIIMI IIIIIIIIMIIII IMIIIIIIII 




Db 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 


Qy 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 








1 II ii ! 1 II ! 1 LM'li 1 1! 




Db 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 





RESULT 8 

US-10-269-806-152 

Sequence 152, Application US/10269806 
Publication No. US20030176352A1 
GENERAL INFORMATION: 
APPLICANT: Min, Hosung 
APPLICANT: Sitney, Karen 
APPLICANT: Hartley, Cynthia 

TITLE OF INVENTION: Peptides and Related Compounds Having Thrombopoietic 
Activity 

FILE REFERENCE: A-750 

CURRENT APPLICATION NUMBER: US/10/269 , 8 06 
CURRENT FILING DATE: 2002-10-10 
NUMBER OF SEQ ID NOS : 199 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 152 
LENGTH: 243 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthesized Peptide Sequence 
US-10-269-806-152 

Query Match 100.0%; Score 1238; DB 12; Length 243; 

Best Local Similarity 100.0%; Pred. No. 2e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI S RT P E VTC VWDVS HED P EVKFNW YV 6 0 

IIMIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIMIIIIII 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 
DGVEVHNAKTKPREEQYNSTYRWS VXiTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 12 0 

IIIIIIIMIIIIIMIIIMMIIIIMIIIIIMIIIIIIIIIIIIIIIIIMIIIII 



IIIMIIMMII 1 IMMIIIMM illlMIIIIIIIII 1 IIIIIIIIIIM 

jQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 
181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 



I II I II MM 1 1 III I IIIIIII I II MM II Mill I II IIIIIII I 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

RESULT 9 

US-10-269-695-243 

Sequence 243, Application US/10269695 
Publication No. US20030229023A1 
GENERAL INFORMATION: 
APPLICANT: OLINER, JONATHAN DANIEL 
APPLICANT: MIN, HOSUNG 

TITLE OF INVENTION: SPECIFIC BINDING AGENTS OF HUMAN ANGIOPOIETIN-2 
FILE REFERENCE: A-801A 

CURRENT APPLICATION NUMBER: US/10/269 , 695 
CURRENT FILING DATE: 2002-10-10 
PRIOR APPLICATION NUMBER: US 60/414,155 
PRIOR FILING DATE: 2002-09-27 
PRIOR APPLICATION NUMBER: US 60/328,624 
PRIOR FILING DATE: 2001-10-11 
NUMBER OF SEQ ID NOS : 359 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 243 
LENGTH: 25 0 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: DNA encoding peptide capable of binding to Ang-2 
US-10-269-695-243 

Query Match 100.0%; Score 1238; DB 12; Length 250; 

Best Local Similarity 100.0%; Pred. No. 2.1e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

QY 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 60 

M M I I I II I I I I I || M I I II II || I || II II II I I || || || I II II I II I || || I I II 
Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 12 0 

Qy 12 1 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

I M M II 1 1 1 II II II 1 1 1 1 II I II 1 1 1 II I M II II II II II 1 1 1 II II 1 1 1 1 II II II 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MMMMMMMMMMMMMIMMMMMMIMIIMM 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 10 
US-10-410-998-243 

; Sequence 243, Application US/10410998 
; Publication No. US20030236193A1 
; GENERAL INFORMATION: 

; APPLICANT: OLINER, JONATHAN DANIEL 
; APPLICANT: MIN, HOSUNG 



; TITLE OF INVENTION: SPECIFIC BINDING AGENTS OF HUMAN ANGIOPOIETIN-2 

FILE REFERENCE: A-8 01A 
; CURRENT APPLICATION NUMBER: US/10/410 , 998 

CURRENT FILING DATE: 2003-04-09 
; PRIOR APPLICATION NUMBER: US/l 0/269 , 695 

PRIOR FILING DATE: 2002-10-10 
; PRIOR APPLICATION NUMBER: US 60/414,155 

PRIOR FILING DATE: 2002-09-27 
; PRIOR APPLICATION NUMBER: US 60/328,624 

PRIOR FILING DATE: 2001-10-11 
; NUMBER OF SEQ ID NOS : 359 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 243 
LENGTH: 250 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: DNA encoding peptide capable of binding to Ang-2 
US-10-410-998-243 



Query Match 100.0%; Score 1238; DB 12; Length 250; 

Best Local Similarity 100.0%; Pred. No. 2.1e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCvVVT)VSHEDPEVKFNWYV 

IMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIllllMMIIIIIIMMIM 

MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYV 


60 


Db 


l 


60 


Qy 


61 


DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 

II MMIIMMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIllllll 

DGVEVHNAKTKPREEQYNSTYR WSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 


120 


Db 


61 


120 


Qy 


121 


KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 

IIMMIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIMII 

KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 


180 


Db 


121 


180 


Qy 


181 


SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MIIIIMIIIIIIIIIIIIIIIIMIMIIIIIIIIIIMIIIIIII 

SDGSFFLYSKLT VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 




Db 


181 





RESULT 11 
US-09-840-277-109 

; Sequence 109, Application US/09840277 

; Patent No. US20020168363A1 

; GENERAL INFORMATION: 

; APPLICANT: FEIGE, ULRICH 

; APPLICANT: KOHNO, TADAHIKO 

; APPLICANT: LACEY, DAVID LEE 

; APPLICANT: BOONE, THOMAS CHARLES 

; TITLE OF INVENTION: INTEGRIN/ADHESION ANTAGONISTS 
; FILE REFERENCE: A-688A 

; CURRENT APPLICATION NUMBER: US/09/840,277 

; CURRENT FILING DATE: 2001-08-14 

; PRIOR APPLICATION NUMBER: 60/198,919 

PRIOR FILING DATE: 2000-04-21 
; PRIOR APPLICATION NUMBER: 60/201,3 94 



PRIOR FILING DATE: 2000-05-03 
; NUMBER OF SEQ ID NOS : 135 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 109 

LENGTH: 282 

TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Echistatin Fc -peptide 
NAME /KEY : misc_feature 
LOCATION: (1) . . (1) 
OTHER INFORMATION: Ndel site 
NAME/ KEY : misc_f eature 
LOCATION: (854).. (854) 
OTHER INFORMATION: BamHI site 
US-09-840-277-109 



Query Match 100.0%; Score 1238; DB 10; Length 282; 

Best Local Similarity 100.0%; Pred. No. 2.5e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
QY 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTC^VVDVSHEDPEVKFNWYV 60 

QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKA 120 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IMMMIIIIMIIMIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIM 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IMIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIMII 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 12 
US-09-389-782-8 

; Sequence 8, Application US/0938 9782 

; Publication No. US20030144187A1 

; GENERAL INFORMATION: 

; APPLICANT: Wooden, Scott K. 

; APPLICANT: Mann, Michael B. 

; APPLICANT: Dunstan, Colin R. 

; TITLE OF INVENTION: OPG Fusion Protein Compositions and Methods 
; FILE REFERENCE: A-604 

; CURRENT APPLICATION NUMBER: US/09/38 9 , 782 

; CURRENT FILING DATE: 1999-09-03 

; NUMBER OF SEQ ID NOS: 50 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 8 

LENGTH: 4 01 

TYPE : PRT 

ORGANISM: Human 
US-09-389-782-8 



Query Match 100.0%; Score 1238; DB 12; Length 401; 

Best Local Similarity 100.0%; Pred. No. 3.8e-99; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

IMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

QY 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IIIIIIIIIIMIMIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIMIIIIIIII 

Db 61 DGVEWNAKTKPREEQYNSTYRWSVIjTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKA 120 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

MMMMMIMMMIMMIIMMMIMMIMMMMMIMMMMIMM 

Db 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

QY 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MM I II Ml Ml Mill III II I IIIIIII IIIIIII IIIIIII III 

Db 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



RESULT 13 
US-10-215-297-2 

; Sequence 2, Application US/10215297 

; Publication No. US20020192222A1 

; GENERAL INFORMATION: 

; APPLICANT: Blumberg, Richard S. 

; APPLICANT: Simister, Neil E. 

; APPLICANT: Lencer, Wayne I. 

; TITLE OF INVENTION: RECEPTOR SPECIFIC TRANSEPITHELIAL TRANSPORT OF 
; TITLE OF INVENTION: THERAPEUTICS 
; FILE REFERENCE: S1383/7003 

; CURRENT APPLICATION NUMBER: US/10/215 , 297 

CURRENT FILING DATE: 2002-08-08 
; PRIOR APPLICATION NUMBER: US 08/578,171 
; PRIOR FILING DATE: 1995-12-29 
; PRIOR APPLICATION NUMBER: US 08/374,159 

PRIOR FILING DATE: 1995-01-17 
; PRIOR APPLICATION NUMBER: US 09/122,144 
; PRIOR FILING DATE: 1998-07-24 
; NUMBER OF SEQ ID NOS : 22 
; SEQ ID NO 2 

LENGTH: 22 9 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-215-297-2 



Query Match 99.7%; Score 1234; DB 14; Length 229; 

Best Local Similarity 99.6%; Pred. No. 4.2e-99; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYV 60 

HIMIIIIIIIIIIIIIIIIIIllllllllllliiiMiiiMiiiiiiiiiiiiiiii 

Db 2 VTDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVWDVSHEDPEVKFNWYV 61 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 



II I II I M l II Ml MINIM II I III II MM 1 1 II II Ml III II MINIM I Ml 

Db 62 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 121 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

MillllMlllllllllllli lllllll IMIIII Mlllll IMIIIIIMIM 

Db 122 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 181 

QY 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIMMIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 182 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 229 



RESULT 14 
US-10-215-298-2 

; Sequence 2, Application US/10215298 

; Publication No. US20030012789A1 

; GENERAL INFORMATION: 

; APPLICANT: Blumberg, Richard S. 

; APPLICANT: Simister, Neil E . 

; APPLICANT: Lencer, Wayne I. 

; TITLE OF INVENTION: RECEPTOR SPECIFIC TRANS E P I THEL I AL TRANSPORT OF 
THERAPEUTICS 

; FILE REFERENCE: S1383/7003 

; CURRENT APPLICATION NUMBER: US/10/215, 298 

CURRENT FILING DATE: 2002-08-08 
; PRIOR APPLICATION NUMBER: US 09/122,144 

PRIOR FILING DATE: 1998-07-24 
; NUMBER OF SEQ ID NOS : 22 
; SEQ ID NO 2 
LENGTH: 22 9 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-215-298-2 

Query Match 99.7%; Score 1234; DB 15; Length 22 9; 

Best Local Similarity 99.6%; Pred. No. 4.2e-99; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVAA^VSHEDPEVKFNWYV 60 

HlllllllllllllllllliiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiii 

Db 2 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 61 

Qy 61 DGVEVHNAKTKPREEQYNSTYR WSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 62 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 121 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IIMMIIMIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIII 

Db 122 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 181 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMII 

Db 182 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 22 9 



RESULT 15 
US-10-145-206-111 



Sequence 111, Application US/10145206 
Publication No. US20030195156A1 
GENERAL INFORMATION: 
APPLICANT: MIN # HOSUNG 
APPLICANT: HSU, HAILING 
APPLICANT: ZIONG, FEI 

TITLE OF INVENTION: PEPTIDES AND RELATED MOLECULES THAT BIND TO TALL-1 
FILE REFERENCE: A- 743 

CURRENT APPLICATION NUMBER: US/10/145 , 206 
CURRENT FILING DATE: 2002-05-13 
PRIOR APPLICATION NUMBER: US 60/290,196 
PRIOR FILING DATE: 2001-05-11 
NUMBER OF SEQ ID NOS : 197 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 111 
LENGTH: 24 8 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: TALL-1 inhibitory peptibodies 
US-10-145-206-111 

Query Match 99.7%; Score 1234; DB 12; Length 248; 

Best Local Similarity 99.6%; Pred. No. 4.6e-99; 

Matches 227; Conservative 1; Mismatches 0; Indels 0; Gaps . 0; 
Qy 1 MDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

HIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIII 

Db 21 VDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 8 0 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVliHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 120 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIMII 

Db 81 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 14 0 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 141 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 2 00 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 201 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 24 8 



Search completed: January 16, 2004, 16:04:07 
Job time : 34 sees 



Gen Core version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: January 16, 2004, 15:55:25 ; Search time 20 Seconds 

(without alignments) 
1096.323 Million cell updates/sec 

Title: US-09-843-221A-2 
Perfect score: 1238 

Sequence: 1 MDKTHTCPPCPAPELLGGPS MHEALHNHYTQKSLSLSPGK 228 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters: 283308 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : PIR_76:* 

1: pirl:* 

2: pir2:* 

3: pir3:* 

4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1233 


99 


.6 


255 


4 


S31866 


ig 


gamma -1 chain C 


2 


1233 


99 


.6 


330 


1 


GHHU 


ig 


gamma -1 chain C 


3 


1227 


99 


.1 


374 


2 


S69339 


ig 


heavy chain V r 


4 


1180 


95 


.3 


234 


2 


PT0207 


ig 


gamma chain C r 


5 


1146 


92 


6 


377 


2 


A23511 


ig 


gamma -3 chain C 


6 


1144 


92 


4 


377 


2 


A60764 


ig 


gamma -3 chain C 


7 


1142.5 


92 


3 


326 


1 


G2HU 


ig 


gamma -2 chain C 


8 


1135 


91 


7 


327 


1 


G4HU 


ig 


gamma -4 chain C 


9 


1121 


90 


5 


289 


1 


G3HUWI 


ig 


gamma -3 heavy c 


10 


918.5 


74 


2 


323 


1 


GHRB 


ig 


gamma chain C r 


11 


906.5 


73 


2 


328 


2 


147160 


Ig 


gamma 2b chain 


12 


906.5 


73. 


2 


328 


2 


147159 


ig 


gamma 2a chain 


13 


903 .5 


73. 


0 


277 


2 


147162 


ig 


gamma 4 chain c 



14 


889 


71 


. 8 


329 


1 


G2GP 


Ig gamma -2 chain C 


15 


885 . 5 


71 


. 5 


328 


2 


147158 


Ig gamma 1 chain c 


16 


878 . 5 


71 


. 0 


328 


2 


147161 


Ig gamma 3 chain c 


17 


855 . 5 


69 


. 1 


470 


2 


S22080 


Ig heavy chain pre 


18 


846 


68 


.3 


308 


2 


C30554 


Ig heavy chain C r 


19 


846 


68 


.3 


472 


2 


S31459 


Ig gamma -1 chain - 


20 


845 . 5 


68 


.3 


329 


1 


G3MSC 


Ig gamma -3 chain C 


21 


838 


67 


. 7 


333 


2 


PS0018 


Ig gamma -2b chain 


22 


834 . 5 


67 


.4 


398 


1 


G3MSM 


Ig gamma -3 chain C 


23 


827 . 5 


66 


. 8 


444 


2 


PC4436 


monoclonal ant i bod 


24 


818 . 5 


66 


. 1 


326 


2 


PS0017 


Ig gamma -1 chain C 


25 


817 . 5 


66 


. 0 


324 


1 


G1MS 


Ig gamma -1 chain C 


26 


812 . 5 


65 


. 6 


393 


1 


G1MSM 


Ig gamma -1 chain C 


27 


809 . 5 


65 


4 


329 


2 


S00847 


Ig gamma -2c chain 


28 


809 


65 


3 


330 


1 


G2MSA 


Ig gamma -2a chain 


29 


809 


65 


3 


469 


2 


S37483 


Ig gamma-2a chain 


30 


804 


64 


9 


399 


1 


G2MSAM 


Ig gamma -2a chain 


31 


802 


64 


8 


335 


1 


G2MSAB 


Ig gamma -2a chain 


32 


794 


64 


1 


446 


2 


S40295 


Ig gamma -2a chain 


33 


785 . 5 


63 


4 


322 


2 


PS0019 


Ig gamma-2a chain 


34 


779 


62 


9 


474 


1 


G2MS11 


Ig gamma -2b chain 


35 


774 


62 


5 


405 


1 


G2MSBM 


Ig gamma -2b chain 


36 


764 


61 


7 


327 


2 


S06611 


Ig gamma -2 chain C 


37 


757 


61 


1 


475 


2 


S01321 


Ig gamma -2b chain 


1 Q 
J O 


/u / 


b / 


1 


Ton 
loU 


A 


146732 


Ig gamma heavy cha 


39 
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S69340 


Ig heavy chain VHI 


40 


574 . 5 


46 


4 


218 


2 


A36040 


Ig heavy chain V-I 
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28 
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2 
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ALIGNMENTS 



RESULT 1 
S31866 

Ig gamma-1 chain C region - synthetic 
C; Species: synthetic 

A;Note: Homo sapiens (man) gene engineered and expressed in Escherichia coli 
C;Date: 06-Jan-1995 #sequence_revision 17-Mar-1997 #text_change ' 19-May-2000 
C; Accession: S3 1866 
R;Filpula, D. 

submitted to the EMBL Data Library, February 1993 

A; Description: Screeing method for protein-protein interactions of cloned gene 
products . 

A; Reference number: S3 18 66 
A;Accession: S31866 
A; Molecule type: mRNA 
A;Residues: 1-255 <FIL> 

A; Cross-references: EMBL:X70421 ; NID:g33068; PIDN : CAA4 9866 . 1 ; PID:g33069 
C ; Keywords : immunoglobul in 

F; 1-22 /Region : Escherichia coli outer membrane protein A precursor 
F; 23 -25 5 /Region : human Ig gamma-1 chain C region 



Query Match 99.6%; Score 1233; DB 4; Length 255; 

Best Local Similarity 100.0%; Pred. No. 6.8e-89; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCvWDVSHEDPEVKFNWYVD 61 

'MINIUM I I llllli! MM! M Mill HIM, |, MM I 

Db 2 9 DKTHTCPPCPAPELI^GPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 88 

QY 62 GVEVHNAKTKPREEQYNSTYRWS VIjTvLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

MMMIMMMMMMMIMMMMMMMMMMMMMMIMMIMM 

Db 89 GVEVHNAKTKPREEQYNSTYRWSVXjTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 148 



Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVXiDS 181 

Ml Mill MINI I II I II III II I Mill I III II II I MINI MM III I MUM 

Db 149 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 208 

Qy 182 DGS F FL YS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLS LS PGK 228 

MIMMMMMMMMMMMMMMMIMMMMMMI 

Db 209 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 255 



RESULT 2 
GHHU 

Ig gamma-1 chain C region - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Jan-1981 #sequence_revision 18-Aug-1982 #text_change 16-Jul-1999 
C;Accession: A93433; S36861; S33887; B90563; A90564; B91668; A91723; A02146 
R;Ellison, J.W. ; Berson, B.J.; Hood, L.E. 
Nucleic Acids Res. 10, 4071-4079, 1982 

A; Title: The nucleotide sequence of a human immunoglobulin C-gammal gene. 

A; Reference number: A93433; MUID: 82274238 ; PMID:6287432 

A;Accession: A93433 

A; Molecule type: DNA 

A; Residues: 1-330 <ELL> 

A; Cross-references : EMBL : Z17370 

A;Note: this sequence has the Glm(17) allotypic marker, 97-Lys, and the Glm(l) 
markers, 23 9 -Asp and 241 -Leu 

A;Note: Lys-330 is removed after translation 
R; Harris, L.J. 

submitted to the EMBL Data Library, October 1992 

A;Reference number: S33904 

A; Access ion: S3 68 61 

A; Molecule type: DMA 

A; Residues: 2-33 0 <HAR> 

A; Cross-references : EMBL: Z17370 

R;Takahashi, N. ; Ueda, S. ; Obata, M. ; Nikaido, T. ; Nakai, S. ; Honjo, T. 
Cell 29, 671-679, 1982 

A;Title: Structure of human immunoglobulin gamma genes: implications for 
evolution of a gene family. 

A;Reference number: S33887; MUID: 83001943 ; PMID:6811139 
A;Accession: S33887 
A; Molecule type: DNA 
A;Residues: 88-113;235-330 <TAK> 
A; Cross-references : EMBL: Z17370 

R;Cunningham, B.A. ; Rutishauser, U. ; Gall, W.E.; Gottlieb, P.D.; Waxdal, M. J. ; 
Edelman, G.M. 

Biochemistry 9, 3161-3170, 1970 



A;Title: The covalent structure of a human gammaG- immunoglobulin. VII. Amino 

acid sequence of heavy-chain cyanogen bromide fragments H-l-H-4. 

A;Reference number: A90563; MUID: 71064024 ; PMID: 5489771 

A; Contents: myeloma protein Eu 

A;Accession : B90563 

A;Molecule type: protein 

A;Residues: 1-96, 'R' , 98-135 <CUN> 

A;Note: this sequence has the Glm(3) marker, 97-Arg 

R;Rutishauser , U. ; Cunningham, B.A.; Bennett, C. ; Konigsberg, W.H.; Edelman, 
G.M. 

Biochemistry 9, 3171-3181, 1970 

A;Title: The covalent structure of a human gammaG- immunoglobulin. VIII. Amino 

acid sequence of heavy-chain cyanogen bromide fragments H-5-H-7. 

A;Reference number: A90564; MUID : 71064025 ; PMID : 5530842 

A; Contents: Eu 

A /Access ion: A90564 

A /Molecule type: protein 

A; Residues: 136-154, 'Q' , 156-165, 'Q' , 167-176, , Q' , 178-194, ' N 1 , 196-197, ' D ' , 199- 
238, ' E ' , 240 , 'M 1 ,242-267, 1 DGEPE ' ,273-329 <RUT> 

A;Note: this sequence has the Glm(non-l) markers, 239-Glu and 241-Met 
R;Ponstingl, H.; Hilschmann, N. 

Hoppe-Seyler's Z. Physiol. Chem. 357, 1571-1604, 1976 
A; Title: Die Primaerstruktur eines monoklonalen IgGl-Immunglobulins 
(Myelomprotein Nie) , III. Die chymotryptischen Peptide der H-Kette, Anordnung 
der tryptischen Peptide und Diskussion der vollstaendigen Primaerstruktur. 
A/Reference number: A91668; MUID: 77070269; PMID: 826475 
A; Contents: myeloma protein Nie 
A;Accession: B91668 
A;Molecule type: protein 

A; Residues: 1-34, 'Q' ,36-96, 'K' ,98-115, 'Q' ,117-197, 'D' ,199-238, 'D' ,24 0, 'L' ,242- 
268, 'E' ,270-271, 'D' ,273-330 <PON> 

A;Note: this sequence has the Glm(17) and Glm(l) markers 
R;Schmidt, W.E.; Jung, H.D.; Palm, W. ; Hilschmann, N. 
Hoppe-Seyler's Z. Physiol. Chem. 364, 713-747, 1983 

A;Title: Die Primaerstruktur des kristallisierbaren monoklonalen Immunglobulins 
IgGl KOL, I. 

A/Reference number: A91723; MUID: 83289131 ; PMID: 6884994 
A/Contents : myeloma protein KOL; disulfide bonds 
A;Accession: A91723 
A;Molecule type: protein 

A; Residues: 1-96, »R' ,98-197, 'D' ,199-238, 'E' ,240, 'M' ,242-266, f D' ,268-271, 1 D 1 ,273- 
330 <SCH> 

A;Note: this sequence has the Glm{3) and Glm(non-l) markers 
R;Gall, W.E.; Edelman, G.M. 
Biochemistry 9, 3188-3196, 1970 

A; Title: The covalent structure of a human gammaG- immunoglobulin. X. Intrachain 
disulfide bonds. 

A;Reference number: A90565; MUID : 71064027 ; PMID:4923144 
A;Contents: annotation; disulfide bonds 
R;Dreker, L. ; Schwarz, J. ; Reichel, W. ; Hilschmann, N . 
Hoppe-Seyler's Z. Physiol. Chem. 357, 1515-1540, 1976 

A; Title: Rule of antibody structure. The primary structure of monoclonal IgGl 
immunoglobulin {myeloma protein Nie), I: purification and characterization of 
the protein, the L- and H-chains the cyanogenbromide cleavage products, and the 
disulfide bridges. 

A; Reference number: A91667; MUID: 77070267 ; PMID:1002129 
A;Contents: annotation; disulfide bonds 



C; Genetics : 

A; Gene: GDB : IGHG1 

A; Cross-references : GDB: 120085 ; OMIM : 147100 
A ; Map position: 14q32 , 33-14q32 . 33 
A;Introns: 99/1; 114/1; 224/1 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 

C;Keywords: duplication; glycoprotein; heterotetramer; immunoglobulin 

F ; 2 0 - 8 5 / Doma in : immunoglobulin homology <IM1> 

F; 137-206/Domain: immunoglobulin homology <IM2> 

F; 243-310/ Doma in : immunoglobulin homology <IM3> 

F;27-83, 144-204, 250-308/Disulfide bonds: #status experimental 

F; 103/Disulf ide bonds: interchain (to light chain) #status experimental 

F; 109, 112/Disulf ide bonds: interchain (to heavy chain) #status experimental 

F; 180/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 99.6%; Score 1233; DB 1; Length 330; 

Best Local Similarity 100.0%; Pred. No. 9.4e-89; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCVVVDVSHEDPEVKFNWYVD 61 

IIIIIIIIIIIIIIMMIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIM 

DKTHTCPPCPAPELLGGPSVFLFPPKPKDTmiSRTPEVTCVVVDVSHEDPEVKFNWYVD 163 
GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKAK 121 

MIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIMIMIIIMIIIIIIIIII 

GVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKAK 223 
GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDI AVEWESNGQPENNYKTTPPVLDS 181 

IIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIMMIIIIIIIII 

GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDI AVEWESNGQPENNYKTTPPVLDS 283 
DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 



RESULT 3 
S69339 

Ig heavy chain V region precursor - human 
C;Species: Homo sapiens (man) 

C;Date: 19-Mar-1997 #sequence_revision 19-Mar-1997 #text_change 01-Dec-2000 
C;Accession: S69339; S72664 

R;Khamlichi, A. A. ; Aucouturier, P.; Preud 1 homme , J.L.; Cogne, M. 
Eur. J. Biochem. 22 9, 54-60 , 1995 

A; Title: Structure of abnormal heavy chains in human heavy- chain-depos it ion 
disease . 

A; Reference number: S69339; MUID : 95262687 ; PMID:7744049 

A; Access ion: S6933 9 

A; Status : preliminary 

A; Molecule type: mRNA 

A;Residues: 1-374 <KHA> 

A; Cross -references : EMBL:X81695 

R;Khamlichi, A. A. 



Qy 


2 


Db 


104 


Qy 


62 


Db 


164 


Qy 


122 


Db 


224 


Qy 


182 


Db 


284 



submitted to the EMBL Data Library, September 1994 

A/Reference number: S72664 

A /Access ion: S72664 

A; Status: preliminary 

A; Molecule type: mRNA 

A;Residues: 1-140, T C , 142-374 <KH2> 

A; Cross-references : EMBL: X8 1695 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 

Query Match 99.1%; Score 1227; DB 2; Length 374; 

Best Local Similarity 99.1%; Pred. No. 3.2e-88; 

Matches 225; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 
QY 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLJviISRTPEvTCWVDVSHEDPEVKFNWYVD 61 

MINI II I II I II Ml III Mill Mill I MM I Mill I MM III Mill II I III 

148 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 207 

QY 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

M I I M I I M I I I II II I I I I II I I II || I M II I I II I II I I II II II II II I I II I I I 
Db 208 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 267 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

M 1 1 1 1 1 f 1 1 1 1 1 1 1 ^ I = 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 , 1 1 1 1 1 1 ,,, | ,, , 

Db 2 68 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 327 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

1 1 ) 1 1 ) r i f r 1 1 1 1 1 r 1 1 1 1 r I j 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 328 DGS FFL YS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 374 



RESULT 4 
PT0207 

Ig gamma chain C region - chimpanzee 
C; Species: Pan troglodytes (chimpanzee) 

C;Date: 23-Nov-1991 #sequence_revision 23-Nov-1991 #text_change 16-Jul-1999 
C;Accession: PT0207 

R;Ehrlich, P.H.; Moustafa, Z.A.; Oestberg, L. 
Mol. Immunol. 28, 319-322, 1991 

A;Title: Nucleotide sequence of chimpanzee Fc and hinge regions. 
A;Reference number: PT0207; MUID: 91287716 ; PMID:2062315 
A; Access ion: PT02 07 
A; Molecule type: mRNA 
A; Residues: 1-234 <EHR> 

C;Superfamily : immunoglobulin C region; immunoglobulin homology 
C ; Keywords : immunoglobul in 

F; 48-117/ Doma in : immunoglobulin homology <IMM> 

Query Match 95.3%; Score 118 0; DB 2; Length 234; 

Best Local Similarity 98.6%; Pred. No. 8.2e-85; 

Matches 217; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELLGGPSWLFPPKPKDTLMISRTPEVTC^ 61 

I 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

15 DTTHTCPPCAAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 74 
Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

IIIMMIMMIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIMIIMIIMI 



Db 75 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 134 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

MINIMI IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIhllllllllllllllll 

Db 135 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYKTTPPVLDS 194 

Qy 182 DGS FFLYS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKS 221 

MM M. 'I: III III I I I I III M III 

Db 195 DGS FFLYSKLTVDKSRWQQGNVFSCS VMHEALHNH YTQKS 234 

RESULT 5 
A23511 

Ig gamma-3 chain C region (allotype G3m(b)) - human 
C; Species: Homo sapiens (man) 

C;Date: 28-Dec-1987 #sequence_revision 28-Dec-1987 #text_change 23-Jul-1999 
C;Accession: A23511 

R;Huck, S./ Fort, P.; Crawford, D.H.; Lefranc, M.P.; Lefranc, G. 
Nucleic Acids Res. 14, 1779-1789, 1986 

A; Title : Sequence of a human immunoglobulin gamma 3 heavy chain constant region 

gene: comparison with the other human C-gamma genes. 

A; Reference number: A23511; MUID : 86148507 ; PMID: 3081877 

A;Accession: A23511 

A; Molecule type: DNA 

A /Residues: 1-377 <HUC> 

A; Cross -references: GB:X03604; GB:M12958; NID:g33070; PIDN : CAA27268 . 1 ; 

PID:g577056 

C; Genetics: 

A; Gene: GDB : IGHG3 

A; Cross -references: GDB:119339; OMIM: 147120 
A ; Map position: 14q32 . 33 -14q32 . 33 

A;Introns: 98/3; 115/3; 130/3; 145/3; 160/3; 270/3 

C; Super family: immunoglobulin C region; immunoglobulin homology 

C; Keywords: immunoglobulin 

F; 20-85/Domain: immunoglobulin homology <IMM> 

Query Match 92.6%; Score 1146; DB 2; Length 377; 

Best Local Similarity 92.5%; Pred. No. 6.7e-82; 

Matches 210; Conservative 8; Mismatches 9; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELIX3GPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

I M IMIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIhl 1 1 1 1 

Db 151 DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEvTCXAAHJVSHEDPEVQFKWYVD 210 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

I I I I M I I I I I I I I I I I I I : | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 211 GVEVHNAKTKPREEQYNSTFRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKTK 270 

QY 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

I H I I II I I I I I I I I : I : | | | M I I I I I II II I I I I I II I I I I : I I I I I I I lllhlll 
D b 271 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 33 0 

Qy 182 DGS FFLYS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 228 

IIIIIIIIIMIIIIIIIIIhllllllllMII :|||||||||ll 
Db 331 DGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSPGK 377 



RESULT 6 
A60764 

Ig gamma-3 chain C region, form LAT - human 
C; Species: Homo sapiens (man) 

C;Date: 14-May-1993 #sequence_revision 14-May-1993 #text_change 16-Jul-1999 
C; Accession: A60764 

R;Huck, S.; Lefranc, G. ; Lefranc, M.P. 
Immunogenetics 30, 250-257, 1989 

A; Title: A human immunoglobulin IGHG3 allele (GmbO, bl, c3 , c5, u) with an IGHG4 
converted region and three hinge exons . 

A/Reference number: A60764; MUID : 90007613 ; PMID: 2571587 
A;Accession: A60764 
A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-377 <HUC> 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 
C ; Keywords : immunoglobul in 

F;20-85/Domain: immunoglobulin homology <IMM> 

Query Match 92.4%; Score 1144; DB 2; Length 377; 

Best Local Similarity 92.5%; Pred. No. 9.6e-82; 

Matches 210; Conservative 8; Mismatches 9; Indels 0; Gaps 0; 

Qy 2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

I M I M I I II I II I I I I I I I I I I | I | | | | | M II I M I I I I I I I I |: | | | M 

Db 151 DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVQFKWYVD 210 

QY 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

I II I II Ml II I III II Ihlll MINI IIIMM I II III II III MINIM III I 

Db 211 GVEVHNAKTKPREEQYNSTFRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKTK 270 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

I M I I I M I I I I I I I : I : I I I I I I I | | | | I I I I I I I I I I I II I : I I I I I I I I I I I I I I I 
Db 271 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPVLDS 33 0 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

M I M lh II I lllllhl llllll INI I II I :MIIIIIIIII 
Db 331 DGSFFLYSRLTVDKSRWQEGNVFSCSVMHEALHNRFTQKSLSLSPGK 377 



RESULT 7 
G2HU 

Ig gamma-2 chain C region - human 
C;Species: Homo sapiens (man) 

C;Date: 30-Apr-1981 #sequence_revision 13-Jun-1983 #text_change 21-Jul-2000 
C;Accession: A93906; A92809; A90752; A93132; A02148 
R;Ellison, J. ; Hood, L. 

Proc. Natl. Acad. Sci. U.S. A, 79, 1984-1988, 1982 

A; Title : Linkage and sequence homology of two human immunoglobulin gamma heavy 
chain constant region genes. 

A;Reference number: A93906; MUID: 82197621 ; PMID:6804948 
A; Accession: A93906 
A; Molecule type: DNA 
A;Residues: 1-326 <ELL> 

A; Cross -references: GB:V00554; GB:J00230; NID:g32759; PIDN : CAB58438 . 1 ; 
PID:g6066056 

A;Note: Lys-326 is probably removed posttranslationally 



R;Wang, A.C.; Tung, E. ; Fudenberg, H.H. 
J. Immunol. 125, 1048-1054, 1980 

A; Title: The primary structure of a human IgG2 heavy chain: genetic, 

evolutionary, and functional implications. 

A;Reference number: A92809; MUID: 81007873 ; PMID: 6774012 

A/Contents : myeloma protein Til 

A;Accession: A92809 

A;Molecule type: protein 

A;ResidueS: 1-19 ,' Q ', 21-57 , ' Z ', 59 , 'A ', 61-193 , 1 D ', 195-325 <WAN> 
A;Note: Trp-156 is at or near the complement -binding site 
R;Connell, G.E.; Parr, D.M. ; Hofmann, T. 
Can. J. Biochem. 57, 758-767, 1979 

A; Title: The amino acid sequences of the three heavy chain constant region 

domains of a human IgG2 myeloma protein. 

A;Reference number: A90752; MUID : 80001357 ; PMID: 113060 

A; Contents: myeloma protein Zie 

A;AccesSion: A90752 

A;Molecule type: protein 

A; Residues: 1-24, 'E 1 ,26-57, 1 EV 1 ,60-85/132-171, ' ZZZ • ,175, 'B' ,177-193, f D' ,195- 

196, 'Q' ,198-234, 'Z' ,236-263, ' BGEPZ 1 ,269-325 <CON> 

A; Note: this sequence has since been revised 

R; Hofmann, T. ; Parr, D.M. 

Mol. Immunol. 16, 923-925, 1979 

A;Title: A note on the amino acid sequence of residues 381-391 of human 
immunoglobulin gamma chains. 

A;Reference number: A93132; MUID: 80114419; PMID: 118920 

A; Contents: Zie 

A; Access ion: A93132 

A; Molecule type: protein 

A;Residues: 238-275 <HOF> 

R,-Hofmann, T. ; Parr, D.M. 

submitted to the Atlas, March 1980 

A; Reference number: A94591 

A; Contents: annotation; Zie, revisions to residues 25, 59, 60, and 264-268 

A; Note: the revised sequence differs from that shown in having 60-Ala and in the 

amidation states of residues 58, 194, and 197; the amidation states of residues 

172-174, 176, and 235 were not determined 

R;Milstein, C; Frangione, B. 

Biochem. J. 121, 217-225, 1971 

A ; Title: Disulphide bridges of the heavy chain of human immunoglobulin G2 . 
A;Reference number: A90253; MUID : 72033500 ; PMID:4940472 
A;Contents: annotation; myeloma protein Sa, disulfide bonds 
R;Frangione, B.; Milstein, C. ; Pink, J.R.L. 
Nature 221, 145-148, 1969 

A;Title: Structural studies of immunoglobulin G. 

A;Reference number: A93157; MUID: 69064124 ; PMID:5782707 

A;Contents: annotation; Sa, disulfide bonds 

C; Genetics : 

A; Gene: GDB : IGHG2 

A; Cross -references: GDB:119338; OMIM: 147110 
A ; Map position: 14q32 . 33-14q32 . 33 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 
light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 
or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 
such as IgA and IgM, the subunits associate into larger oligomers. 
C;Superfamily: immunoglobulin C region; immunoglobulin homology 
C;Keywords: duplication; glycoprotein; heterotetramer; immunoglobulin 



F;20-85/Domain: immunoglobulin homology <IM1> 
F; 133 -2 02 /Domain : immunoglobulin homology <IM2> 
F;239-306/Domain: immunoglobulin homology <IM3> 

F; 14/Disulf ide bonds: interchain (to light chain) #status experimental 

F;27-83, 140-200, 246-304/Disulf ide bonds: #status experimental 

F; 102, 103, 106, 109/Disulf ide bonds: interchain (to heavy chain) #status 

experimental 

F; 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 



Query Match 92.3%; Score 1142.5; DB 1; Length 326; 

Best Local Similarity 94.1%; Pred. No. le-81; 

Matches 209; Conservative 8; Mismatches 4; Indels 1; Gaps 1; 



Qy 
Db 


7 

106 


CPPCPAPELLGGPSVFLFPPKPKDTKVIISRTPEvTCVVVDVSHEDPEVKFNWYVDGVEVH 

Illllll : IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIhlllllllllll 

CPPCPAPP-VAGPSVFLFPPKPKDTLMISRTPEWCvWDVSHEDPEVQFNWYVDGVEVH 


66 
164 


Qy 
Db 


67 
165 


NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 

MlllllllhllhlllllllhlllllMIIIIIIIIM IIIIIIIIIII MINI 

NAKTKPREEQFNSTFRWSVLTWHQDWLNGKEYKCKVSNKGLPAPI EKTI SKTKGQPRE 


126 
224 


Qy 


127 


PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 

MlllllllhhIIIIMIIIMIIMIIIIIMIIIIIIIIIIIIIMhIMIIIII 

PQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPMLDSDGSFF 


186 


Db 


225 


284 


Qy 


- 187 


LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

i i iii i i i 1 ! i : .I... 

LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 326 




Db 


285 





RESULT 8 
G4HU 

Ig gamma -4 chain C region - human 
C; Species: Homo sapiens (man) 

C;Date: 02-Apr-1982 #sequence_revision 02-Apr-1982 #text__change 16-Jul-1999 
C;Accession: A90933; A90249; A02150 
R;Ellison, J.; Buxbaum, J.; Hood, L. 
DNA 1, 11-18, 1981 

A; Title: Nucleotide sequence of a human immunoglobulin C-gamma4 gene. 
A;Reference number: A90933; MUID: 83157104 ; PMID:6299662 
A;Accession: A90933 
A;Molecule type: DNA 
A;Residues: 1-327 <ELL> 

A; Note: the sequence was determined from the germline gene 
R;Pink, J.R.L.; Buttery, S.H.; DeVries, G.M.; Milstein, C. 
Biochem. J. 117, 33-47, 1970 

A;Title: Human immunoglobulin sublclasses. Partial amino acid sequence of the 
constant region of A gamma4 chain. 

A;Reference number: A90249; MUID: 70207560; PMID:4192699 

A;Accession: A90249 

A;Molecule type: protein 

A;Residues: l-30;81-326 <PIN> 

C;Genetics : 

A; Gene: GDB : IGHG4 

A; Cross -references : GDB: 119340; OMIM: 147130 
A ; Map position: 14q32 . 33 -14q32 . 33 
A;IntronS: 99/1; 111/1; 221/1 



C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 
light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 
or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 
such as IgA and IgM, the subunits associate into larger oligomers. 
C;Superfamily : immunoglobulin C region; immunoglobulin homology 
C; Keywords : duplication; glycoprotein; heterotetramer; immunoglobulin 
F; 20-85/Domain: immunoglobulin homology <IM1> 
F; 99-110/Region: hinge 

F; 134 -203/Domain : immunoglobulin homology <IM2> 
F;240-307/Domain: immunoglobulin homology <IM3> 

F; 14/Disulf ide bonds: interchain (to light chain) #status experimental 
F;27-83, 141-201, 247-305/Disulfide bonds: #status predicted 

F;106,109/Disulfide bonds: interchain (to heavy chain) #status experimental 
F; 177/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 91.7%; Score 1135; DB 1; Length 327; 

Best Local Similarity 93.7%; Pred. No. 4e-81; 

Matches 208; Conservative 8; Mismatches 6; Indels 0; Gaps 0; 

CPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPE VTCVWDVSHEDPEVKFNWYVDGVE VH 6 6 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I llllhlllllllllll 
CPSCPAPEFLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSQEDPEVQFNWYVDGVEVH 165 

NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

MM I II 1 1 hi I llllllllllllll llllllllllllll Ih II I II Mill 1 1 

NAKTKPREEQFNSTYRWSVLTVLHQDWLNGKEYKCKVSNKGLPSSI EKTI SKAKGQPRE 225 
PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 18 6 

i 1 I 1 1 1 1 ^ ! i : j I I I i i : I 111 1 1 1 . 1 1 1 I . I i I 1 , 1 1 1 1 ^ 

PQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDI AVEWESNGQPENNYKTTPPVLDSDGSFF 285 
LYSKLTVDKSRWQQGNVFS CS VMHEALHNHYTQKSLSLSPGK 228 

I I I = 1 1 1 1 1 1 1 1 1 = M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 II 



Qy 


7 


Db 


106 


Qy 


67 


Db 


166 


Qy 


127 


Db 


226 


Qy 


187 


Db 


286 



RESULT 9 
G3HUWI 

Ig gamma -3 heavy chain disease proteins - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Dec-1979 #sequence_revision 23-Oct-1981 #text_change 16-Jul-1999 
C;Accession: A90442; A92219; A90198; A93915; A02149 
R,-Frangione, B.; Rosenwasser, E . ; Prelli, .F. ; Franklin, E.C. 
Biochemistry 19, 4304-4308, 1980 

A;Title: Primary structure of human gamma3 immunoglobulin deletion mutant: 
gamma3 heavy-chain disease protein Wis. 

A;Reference number: A90442; MUID: 81021548 ; PMID: 6774747 

A; Contents: heavy chain disease protein Wis 

A;Accession: A90442 

A; Molecule type: protein 

A; Residues: 1-289 <FRA> 

A;Note: the molecule is a dimer linked by 12 disulfide bonds; it has an extra 
interchain disulfide bond at position 7 in addition to the 11 normally present 
in the hinge region 

A;Note: this protein lacks most of the V region and all of the CHI region. 
Residue 12 corresponds to the beginning of the hinge region of normal gamma-3 
chains 



A;Note: the sequence of residues 42-76 was taken from the reference that follows 
R;Michaelsen, T.E.; Frangione, B.; Franklin, E.C. 
J. Biol. Chem. 252, 883-889, 1977 

A;Title: Primary structure of the 'hinge 1 region of human IgG3 . Probable 
quadruplication of a 15-amino acid residue basic unit. 
A;Reference number: A92219; MUID: 77118561; PMID:402363 

A;Contents: normal gamma-3 chains, sequence corresponding to residues 12-97 of 
protein Wis 
A;Accession: A92219 
A;Molecule type: protein 
A;Residues: 12-97 <MIC> 

A;Note: the hinge region in gamma-3 chains is about four times as long as in 
other gamma chains and contains three identical 15-residue segments (29-43, 44- 
58, and 59-73) preceded by a similar 17-residue segment (12-28) 
A;Note: cysteines at positions 24, 27, 33, 39, 42, 48, 54, 57, 63, 69, and 72 
form inter-heavy chain bonds 

R;Wolfenstein-Todel, C. ; Frangione, B. ; Prelli, F. ; Franklin, E.C. 
Biochem. Biophys . Res. Commun. 71, 907-914, 1976 

A; Title: The amino acid sequence of "heavy chain disease" protein ZUC. Structure 

of the Fc fragment of immunoglobulin G3 . 

A;Reference number: A90198; MUID: 77021516; PMID: 823945 

A;Contents: heavy chain disease protein Zuc, partial sequence corresponding to 

residues 59-289 of protein Wis 

A;Accession: A90198 

A; Molecule type: protein 

A;Residues: 59-125 , ' EB », 128 -226 , 228-289 <WOL> 

A;Note: this protein lacks most of the V region, all of the CHI region, and part 
of the hinge compared with normal gamma-3 heavy chains 

R;Alexander, A. ; Steinmetz, M. ; Barritault, D. ; Frangione, B. ; Franklin, E.C; 
Hood , L . ; Buxbaum , J.N. 

Proc. Natl. Acad. Sci. U.S.A. 79, 3260-3264, 1982 

A; Title: gamma heavy chain disease in man: cDNA sequence supports partial gene 
deletion model. 

A;Reference number: A93915; MUID : 82247835 ; PMID:6808505 
A; Contents: heavy chain disease protein Omm 
A;Accession: A93915 
A; Molecule type: mRNA 

A; Residues: 12-70; 72-114 ; 116-125 , 'E» , 127-133, 1 L ' , 135-136, 1 E ' , 138 , ' Y 1 ,140- 

154, f D' ,156-157, 1 N ' , 159-181 , ' A 1 , 183-186 ; 199-226 , ' N 1 ,228-278, 1 Y 1 ,280-289 <ALE> 

A; Note: a carboxyl -terminal Lys is removed posttranslationally 

A; Note: this sequence may represent an allelic form or another gamma chain 

subclass 

C;Comment: The heavy chain disease protein Wis is shown. 

C;Genetics : 

A; Gene: GDB : I GHG3 

A; Cross -references : GDB: 119339; OMIM: 147120 
A ; Map position: 14q32 . 33 -14q32 . 33 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 

C; Keywords: duplication; glycoprotein; immunoglobulin; pyroglutamic acid 

F;203-270/Domain: immunoglobulin homology <IMM> 

F;l/Modified site: pyrrolidone carboxylic acid (Gin) #status experimental 
F; 6, 140/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 90.5%; Score 1121; DB 1; Length 289; 

Best Local Similarity 90.3%; Pred. No. 4.2e-80; 

Matches 204; Conservative 13; Mismatches 9; Indels 0; Gaps 0; 



Qy 


2 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 


61 






1 II Ml M 1 Ml 1 1 1 III 1 1 1 UN MINIUM M 




Db 




DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVQFKWYVD 


123 


Qy 


62 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 


121 






1 1 ll 1 1 Mill M IMIM , MM . 1 II Ml | 




LJXJ 


X ^ rt 


GVQVHNAKTKPREQQFNSTFRWS VLTVLHQNWLDGKE YKCKVSNKALPAP I EKTI SKTK 


183 


Qy 


122 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 18 1 






M ■ 1 1 M I; Ml M 1 1 IIMMIMM 1 1 1 1 = 1 1 1 




Db 


184 


GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 


243 


Qy 


182 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 227 








MMMI 1 Ml Ml 1 MMIMM III III 1 Ml 




Db 


244 


DGSFFLYSKLTVDKSRWQQGNI FSCS VMHEALHNRFTQKSLSLS PG 28 9 





RESULT 10 
GHRB 

Ig gamma chain C region - rabbit 

C; Species: Oryctolagus cuniculus {domestic rabbit) 

C;Date: 24-Apr-1984 #sequence_revision 15-Nov-1984 #text_change 16-Jul-1999 
C;Accession: A91749; A90290; A93928; A90245; A94416; A02161 
R;Bernstein, K.E.; Alexander, C.B.; Mage, R.G. 
Immunogenetics 18, 387-397, 1983 

A;Title: Nucleotide sequence of a rabbit IgG heavy chain from the recombinant F- 
I haplotype. 

A; Reference number: A91749; MUID: 84030930; PMID:6313520 
A /Access ion: A9174 9 
A /Molecule type: mRNA 
A; Residues: 1-323 <BER> 

A; Note: this sequence has the dl2 allotypic marker, 104 -Thr, and the el4 marker, 
185-Thr 

R; Pratt, D.M.; Mole, L.E. 
Biochem. J . 151, 337-349, 1975 

A; Title: Sequence studies on the constant region of the Fd sections of rabbit 
immunoglobulin G of different allotype. 

A; Reference number: A90290; MUID: 76135469; PMID:1243651 

A/Accession: A90290 

A; Molecule type: protein 

A;ResidueS: 1-47 E ' , 49-71 ,' PV f , 72-128 <PRA> 

R;Martens, C.L.; Moore, K.W. ; Steinmetz, M. ; Hood, L. ; Knight, K.L. 
Proc. Natl. Acad. Sci. U.S.A. 79, 6018-6022, 1982 

A; Title: Heavy chain genes of rabbit IgG; isolation of a cDNA encoding gamma 
heavy chain and identification of two genomic C-gamma genes. 
A;Reference number: A93928; MUID: 83299917 ; PMID: 6193512 
A;Accession: A93928 
A /Molecule type: mRNA 

A; Residues: 88 -103 , »M 105-143 , ' E ',145-184 , 'A 1 , 186 , 1 E 1 , 188-266 <MAR> 
A; Cross -references: GB:M16426; NID:gl65111; PIDN: AAA31289 . 1; PID:gl65112 
A;Note: this sequence has the dll allotypic marker, 104 -Met, and the el5 
allotypic marker, 185-Ala 

R,-Fruchter, R.G.; Jackson, S.A.; Mole, L.E.,- Porter, R.R. 
Biochem. J. 116, 249-259, 1970 

A; Title: Sequence studies of the Fd section of the heavy chain of rabbit 
immunoglobulin G. 

A; Reference number: A90245; MUID: 70110015; PMID:5461106 



A; Access ion: A90245 

A;Molecule type: protein 

A;Residues: 132-143 ,' E ', 145-161 <FRU> 

R;Hill, R.L.; Lebovitz,. H.E.; Fellows Jr., R.E.; Delaney, R. 

in Gamma Globulins, Nobel Symp. 3, Killander, J., ed. , pp. 109-127, Almqvist and 

Wiksell, Stockholm, 1967 

A; Reference number: A94416 

A/Accession: A94416 

A; Molecule type: protein 

A; Residues: 12 9-131 ; 155-172 , 'D» ,174-184, ! A' ,186, 'E' ,188-200, f D ! ,202-217, 'E' ,219- 
232, »Q' ,234-245, 'D' ,247-255, 'G' ,257-259, 1 D ' , 261-265 , ' D 1 ,267-279, 'W ,281- 
283, 'S 1 ,285-322 <HIL> 

A;Note: this has the el5 allotypic marker, 185-Ala 

C;Complex: An immunoglobulin heterotetramer subunit consists of two. identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Super family: immunoglobulin C region; immunoglobulin homology 

C; Keywords : duplication; glycoprotein; heterotetramer; immunoglobulin 

F; 2 0-82/Domain: immunoglobulin homology <IM1> 

F; 130-199/Domain: immunoglobulin homology <IM2> 

F;236-303/Domain: immunoglobulin homology <IM3> 

F; 173/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 74.2%; Score 918,5; DB 1; Length 323; 

Best Local Similarity 71.7%; Pred. No. 2.9e-64; 

Matches 167; Conservative 29; Mismatches 32; Indels 5; Gaps 2; 

MDKT HTC- - PPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVK 55 

MM II I II ' I I II I M I M I I II I I I I I . |[ I I I I I I I I I I :||lh 

VDKTVAPSTCSKPTCPPPELLGGPSVFI FPPKPKDTLMI SRTPEVTCVWDVSQDDPEVQ 150 

FNW YVDGVEVHNAKTKPREEQYNSTYRWS VLTVLHQDWLNGKE YKCKVSNKALPAP I EK 115 

I I h IhhMI | | | | | : HIM IM = |||| llllllllll 

FTWYI NNEQVRTARPPLREQQFNSTI R WSTLP I THQDWLRGKEFKCKVHNKALPAP I EK 210 

TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTT 175 

Ml: ■ Ihllh I hlh llllh: llllllhllll Ih hlllll 
TI SKARGQPLEPKVYTMGPPREELSSRSVSLTCMINGFYPSDI S VEWEKNGKAEDNYKTT 270 

P P VLDSDGS FFLYS KLTVDKSRWQQGN VF S CS VMHEALHNHYTQKS LSLS PGK 228 

I MMIIhllhlhl I Ihhihillllllllllllllhl MM 



Qy 


1 


Db 


91 


Qy 


56 


Db 


151 


Qy 


116 


Db 


211 


Qy 


176 


Db 


271 



RESULT 11 
147160 

Ig gamma 2b chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Access ion: 147160 

R;Kacskovics, I.; Sun, J.; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A; Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A; Reference number: 147158; MUID: 95015845 ; PMID:7930579 
A; Access ion : 147160 



A;Status: preliminary; translated from GB/EMBL/DDBJ 
A /Molecule type: mRNA 
A; Residues : 1-328 <KAC> 

A; Cross-references: EMBL:U03780; NID:g433125; PIDN : AAA52218 . 1 ; PID:g433126 
C; Genetics: 
A; Gene: IgG2b 

C;Superfamily : immunoglobulin C region; immunoglobulin homology 
F; 133 -202/Domain : immunoglobulin homology <IMM> 



Query Match 73.2%; Score 906.5; DB 2; Length 328; 

Best Local Similarity 73.2%; Pred. No. 2.6e-63; 

Matches 164; Conservative 29; Mismatches 28; Indels 3; Gaps 2; 
Qy 7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVH 66 

II III I Ml ■ I M'MMII ,1 I III ; 'l hill I'll 

Db 106 CPICPACE-SPGPSVFIFPPKPKDTMISRTPQWCVVVDVSQENPEVQFSWYVDGVEVH 164 

QY 67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPRE 126 

I i I i I I ! : Illlllllhlllhll HIM : IMIIII II 

Db 165 TAQTRPKEEQFNSTYRWSVljPIQHQDWLNGKEFKCKVNNKDLPAPITRIISKAKGQTRE 224 

Qy 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ- - PENNYKTTPPVLDSDGS 184 

IMIMII :||::::||:|||| |||| II Mh III I I Ihllll I I 
Db 225 PQVYTLPPHAEELSRSKVSITCLVIGFYPPDIDVEWQRNGQPEPEGNYRTTPPQQDVDGT 284 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MINI :||h II I H hlllllllllllllhl Mil 
Db 28 5 YFLYSKFSVDKASWQGGGI FQCAVMHEALHNHYTQKS I SKTPGK 328 



RESULT 12 
147159 

Ig gamma 2a chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Access ion: 14715 9 

R;Kacskovics, I.; Sun, J. ; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A; Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A; Reference number: 147158; MUID : 95015845 ; PMID:7930579 
A; Access ion: 147159 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-328 <KAC> 

A; Cross -references: EMBL:U03779; NID:g433123; PIDN : AAA52217 . 1 ; PID:g433124 
C; Genetics : 
A; Gene: IgG2a 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 
F; 133-202/Domain: immunoglobulin homology <IMM> 

Query Match 73.2%; Score 906.5; DB 2; Length 328; 

Best Local Similarity 73.2%; Pred. No. 2.6e-63; 

Matches 164; Conservative 29; Mismatches 28; Indels 3; Gaps 2; 
Qy 7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH 66 

M Ml I 1 l.l l 1 1 1 1 1 ■ I ' 1 , 1 1 ^ 1 1 1 1 1 ' 1 1 hllhhlllllllll 



Db 106 CPICPACE-SPGPSVFIFPPKPKDTLMISRTPQVTCVWDVSQENPEVQFSWYVDGVEVH 164 

Qy 67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

hhhllhllMIMMI : IIIIIMIhlllhll HIM : MMIII II 
Db 165 TAQTRPKEEQFNSTYRWSVLPIQHQDWLNGKEFKCKVNNKDLPAPITRI I SKAKGQTRE 224 

Qy 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ--PENNYKTTPPVLDSDGS 184 

IMIIII :||::::||:|||| |||| || |||: ||| || ||:|||| I lh 
Db 225 PQVYTLPPHAEELSRSKVSITCLVIGFYPPDIDVEWQRNGQPEPEGNYRTTPPQQDVDGT 284 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

: I I I I I Ulh II I :| I = I I I I I I I I I I I I I I H II 
Db 28 5 YFLYSKFSVDKASWQGGG I FQCAVMHEALHNHYTQKS I S KTPGK 328 

RESULT 13 
147162 

Ig gamma 4 chain constant region - pig (fragment) 
C;Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C /Accession : 147162 

R; Kacskovics , I.; Sun, J.; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A; Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A; Reference number: 147158; MUID : 95015845 ; PMID : 793 0579 
A;Accession: 147162 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-277 <KAC> 

A; Cross-references : EMBL: U03782 ; NID:g433129; PIDN: AAA52220 . 1 ; PID:g433130 
C; Genetics : 
A; Gene: IgG4 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 
F; 82-151/ Doma in : immunoglobulin homology <IMM> 

Query Match 73.0%; Score 903.5; DB 2; Length 277; 

Best Local Similarity 71.1%; Pred. No. 3.6e-63; 

Matches 167; Conservative 30; Mismatches 31; Indels 7; Gaps 4; 

Qy 1 MDK THTCPPCP-APELLG-GPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVK 55 

: I I I I I I I I I I III -I I I I I I ( I I I I I I I h I I I I I I I I h||h 

Db 43 VDKRVGTKTKPPCPICPACEGPGPSAFIFPPKPKDTLMISRTPKVTCVWDVSQENPEVQ 102 

Qy 56 FNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK 115 

■Mil hhhllhllllllllll : IIIIIMIhlllhll Mill : 
Db 103 FSWY VDGVE VHTAQTRPKEEQFNSTYRWS VLP I QHQDWLNGKEFKCKVNNKDLPAP I TR 162 

Qy 116 TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ- -PENNYK 173 

II III' :||::::h||||| |||| || |||: ||| || ||: 
Db 163 I I SKAKGQTRE PQVYTLPPPTEELSRSKVTLTCLVTGFYPPDIDVEWQRNGQPEPEGNYR 222 

Qy 174 TTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MM I IhMMIII Mh I I : I : I h 1 1 1 1 1 1 1 1 1 1 1 1 1 h MM 

Db 223 TTPPQQDVBGTYFLYSKLAVDKASWQRGDTFQCAVMHEALPINHYTQKSIFKTPGK 277 



RESULT 14 
G2GP 

Ig gamma -2 chain C region - guinea pig 
C; Species: Cavia porcellus (guinea pig) 

C;Date: 07-May-1981 #sequence_revision 07-May-1981 #text_change 16-Jul-1999 

C;Accession: A94553; A90352; A90359; A90384; A90385; A02151 

R ; Tr i s chmann , T . M . 

submitted to the Atlas, April 1975 

A; Reference number: A94553 

A; Accession: A94553 

A;Molecule type: protein 

A; Residues: 1-3 <TRI> 

R;Birshtein, B.K.; Hussain, Q.Z.; Cebra, J.J. 
Biochemistry 10, 18-25, 1971 

A;Title: Structure of heavy chain from strain 13 guinea pig immunoglobulin-G (2) . 
III. Amino acid sequence of the region around the half-cystine joining heavy and 
light chains. 

A;Reference number: A90352; MUID: 71058471 ; PMID:5538606 

A/Accession: A90352 

A; Molecule type: protein 

A/Residues : 4-68 <BIR> 

R;Turner, K.J.; Cebra, J.J. 

Biochemistry 10, 9-17, 1971 

A;Title: Structure of heavy chain from strain 13 guinea pig immunoglobulin-G (2 ) . 
II. Amino acid sequence of the carboxyl -terminal and hinge region cyanogen 
bromide fragments . 

A/Reference number: A90359; MUID: 71058486 ; PMID:5538616 

A;Accession: A90359 

A; Molecule type: protein 

A/Residues : 69-133 ,-312-329 <TUR> 

R;Tracey, D.E.; Cebra, J.J. 

Biochemistry 13, 4796-4803, 1974 

A; Title: Primary structure of the C-H2 homology region from guinea pig IgG2 
antibodies . 

A;Reference number: A90384; MUID: 75036072 ; PMID:4429665 

A; Accession: A90384 

A; Molecule type: protein 

A; Residues: 134-226 <TRA> 

R;Tr is chmann, T.M.; Cebra, J.J. 

Biochemistry 13, 4804-4811, 1974 

A; Title: Primary structure of the C-H3 homology region from guinea pig IgG2 
antibodies . 

A;Reference number: A90385; MUID : 75036073 ; PMID:4609467 

A;Accession: A90385 

A;Molecule type: protein 

A;Residues: 227-311 <TR2> 

R;01iveira, B . ; Lamm, M.E. 

Biochemistry 10, 26-31, 1971 

A;Title: Interchain disulfide bridges of guinea pig gamma-2- immunoglobulin. 

A;Reference number: A90354; MUID : 71058474 ; PMID: 4922544 

A;Contents: annotation; disulfide bonds 

A; Note: Cys-16 i s involved in a heavy- light chain bond 

A;Note: Cys-105, Cys-107, and Cys-110 form inter-heavy chain bonds 

C;Comment: This chain was isolated from pooled serum of strain 13 inbred guinea 

pigs. 

C;Complex: An immunoglobulin heterotetramer subunit consists of two identical 
light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 



or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 

C; Keywords : duplication; glycoprotein; heterotetramer ; immunoglobulin 

F ; 2 1 - 8 1 / Doma in : immunoglobulin homology <IM1> 

F; 135 -2 04 /Domain : immunoglobulin homology <IM2> 

F;241-310/Domain: immunoglobulin homology <IM3> 

F; 28-79/Disulf ide bonds: #status experimental 

F; 142 -202/Disulf ide bonds: #status experimental 

F; 178/Binding site: carbohydrate (Asn) (covalent) #status experimental 
F;248-308/Disulf ide bonds: #status experimental 

Query Match 71.8%; Score 889; DB 1; Length 329; 

Best Local Similarity 72,3%; Pred. No. 6e-62; 

Matches 162; Conservative 24; Mismatches 36; Indels 2; Gaps 1; 

Qy 6 TCPPCPAPELLGGPSVFLFPPKPKDTmiSRTPEVTCVVVDVSHEDPEVKFNWYVDGVEV 65 

III M II llllllhllllllllllll II lllllllll ::|lhl hll I 
Db 106 TCPKCPPPENLGGPSVFI FPPKPKDTLMI SLTPRVTCVAATDVSQDEPEVQFTWFVDNKPV 165 

Qy 66 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPR 125 

Ihllll lllhhil IN : Mill llhllll llllllllllllll II II 

Db 166 GNAETKPRVEQYNTTFRVES VLPI QHQDWLRGKEFKCKVYNKALPAPI EKTI SKTKGAPR 225 



Qy 126 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP- -ENNYKTTPPVLDSDG 183 

I I I I I I I I I I I h h : I h I I h 1 = 1 = 11 III II I I II llh hll 
Db 226 M PDVYTLP PSRDELS KS KVS VTCL 1 1 NFF PAD I HVEWASNRVPVS EKE YKNTP P I EDADG 285 

Qy 184 SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 227 

hlllllllllll I II h:|! IIIIIMI llh:| III 

Db 286 SYFLYSKLTVDKSAWDQGTVYTCSVMHEALHNHVTQKAISRSPG 329 



RESULT 15 
147158 

Ig gamma 1 chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C;Accession: 147158 

R;Kacskovics, I.; Sun, J.; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A; Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A; Reference number: 147158; MUID : 95015845 ; PMID:7930579 
A; Access ion: 147158 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-328 <KAC> 

A; Cross -references : EMBL:U03778; NID:g433121; PIDN : AAA52216 . 1 ; PID:g433122 
C; Genetics : 
A; Gene: IgGl 

C;Superfamily : immunoglobulin C region; immunoglobulin homology 
F; 133 -202/Domain : immunoglobulin homology <IMM> 



Query Match 71.5%; Score 885.5; DB 2; Length 328; 

Best Local Similarity 72.4%; Pred. No. l.le-61; 

Matches 163; Conservative 27; Mismatches 32; Indels 3; Gaps 2; 



Qy 6 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVEV 65 

Ml II I = 'I- -I : I I : I | M M 

Db 105 TCPICPGCE-VAGPSVFIFPPKPKDTLMISQTPEVTCWVDVSKEHAEVQFSWYVDGVEV 163 

Qy 66 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPR 125 

I I : I : I : I I h I I I I I I I M I I I I I I I I I I I I I = I Mill :||||| II I 
Db 164 HTAETRPKEEQFNSTYRWS VLP I QHQDWLKGKEFKCKVNNVDLPAP I TRTI SKAI GQSR 223 

Qy 126 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ - - PENNYKTTPPVLDSDG 183 

MINIM! MM-MMM l 1 1 1 1 1 1 IMMII III MMM I II 

Db 224 EPQVYTLPPPAEELSRSKVTLTCLVIGFYPPDIHVEVJKSNGQPEPENTYRTTPPQQDVDG 283 

Qy 184 SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

Ml MMM I | hlllllllllllllhl : || 
Db 284 TFFLYSKLAVDKARWDHGDKFECAVMHEALHNHYTQKSISKTQGK 328 
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Job time : 21 sees 
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(without alignments) 
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mus musculu 


13 
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.0 
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1 
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40 
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21. 


.6 
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1 


ALC1_G0RG0 
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41 
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.6 
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1 


ALC2_HUMAN 


P01877 


homo sapien 


42 
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20. 
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1 


MUCM ICTPU 


P23735 
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43 
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19. 


,8 


344 


1 


ALCJ40USE 
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44 
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.5 
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P97797 


m protein-t 


45 
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14 . 


,3 
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1 


LAC1_M0USE 


P01843 


mus musculu 



ALIGNMENTS 



RESULT 1 
GC1_HUMAN 

ID GC1_HUMAN STANDARD; PRT; 330 AA. 

AC P01857; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -1 chain C region. 

GN IGHG1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I_TaxI D= 9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=82274238; PubMed=6287432 ; 

RA Ellison J.W., Berson B.J., Hood L.E.; 

RT "The nucleotide sequence of a human immunoglobulin C gammal gene."; 

RL Nucleic Acids Res. 10:4071-4079(1982). 

RN [2] 

RP SEQUENCE OF 1-135 (MYELOMA PROTEIN EU) . 

RX MEDLINE=71064024; PubMed=54 8 9771 ; 

RA Cunningham B.A., Rutishauser U. , Gall W.E., Gottlieb P.D., 

RA Waxdal M.J., Edelman G.M. ; 



RT "The covalent structure of a human gamma G- immunoglobulin. VII. Amino 

RT acid sequence of heavy-chain cyanogen bromide fragments H1-H4." ; 

RL Biochemistry 9:3161-3170(1970). 
RN [3] 

RP SEQUENCE OF 136-329 (EU) . 

RX MEDLINE=71064025; PubMed-5530842 ; 

RA Rutishauser U. , Cunningham B.A. , Bennett C. , Konigsberg W.H., 

RA Edelman G.M. ; 

RT "The covalent structure of a human gamma G- immunoglobulin. 8. Amino 

RT acid sequence of heavy-chain cyanogen bromide fragments H5-H7."; 

RL Biochemistry 9:3171-3181(1970). 

RN [4] 

RP SEQUENCE (MYELOMA PROTEIN NIE) . 

RX MEDLINE=77070269; PubMed=826475 ; 

RA Ponstingl H. , Hilschmann N. ; 

RT "The rule of antibody structure. The primary structure of a 

RT monoclonal IgGl immunoglobulin (myeloma protein Nie) . III. The 

RT chymotryptic peptides of the H-chain, alignment of the tryptic 

RT peptides and discussion of the complete structure."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 357:1571-1604(1976). 

RN [5] 

RP SEQUENCE (MYELOMA PROTEIN KOL) , AND DISULFIDE BONDS . 

RX MEDLINE=83289131; PubMed=6884994 ; 

RA Schmidt W.E., Jung H.-D., Palm W. , Hilschmann N.; 

RT "Three-dimensional structure determination of antibodies. Primary 

RT structure of crystallized monoclonal immunoglobulin IgGl KOL, I . " ; 

RL Hoppe-Seyler's Z. Physiol. Chem. 364:713-747(1983). 

RN [6] 

RP DISULFIDE BONDS. 

RX MEDLINE=71064027; PubMed=4 923 144 ; 

RA Gall W.E., Edelman G.M.; 

RT "The covalent structure of a human gamma G- immunoglobulin. X. 

RT Intrachain disulfide bonds."; 

RL Biochemistry 9:3188-3196(1970). 

RN [7] 

RP DISULFIDE BONDS. 

RX MEDLINE=77070267; PubMed=l 002 12 9 ; 

RA Dreker L. , Schwarz J., Reichel W. , Hilschmann N. ; 

RT "Rule of antibody structure. The primary structure of a monoclonal 

RT IgGl immunoglobulin (myeloma protein Nie), I: Purification and 

RT characterization of the protein, the L- and H-chains, the 

RT cyanogen bromide cleavage products, and the disulfide bridges."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 357:1515-1540(1976). 

RN [8] 

RP X-RAY CRYSTALLOGRAPHY (2.9 ANGSTROMS). 

RX MEDLINE=81208100; PubMed=7236608 ; 

RA Deisenhofer J. ; 

RT " Crystal lographic refinement and atomic models of a human Fc fragment 

RT and its complex with fragment B of protein A from Staphylococcus 

RT aureus at 2.9- and 2. 8 -A resolution ." ; 

RL Biochemistry 20:2361-2370(1981). 

CC -!- MISCELLANEOUS: NIE HAS THE G1M(17) ALLOTYPIC MARKER, 97-K, & THE 
CC G1M(1) MARKERS, 239-D & 241-L. KOL & EU SEQUENCES HAVE THE G1M(3) 

CC MARKER & THE G1M (NON-1) MARKERS. 

CC -!- MISCELLANEOUS: NIE ALSO DIFFERS IN THE AMIDATION STATES OF 
CC 35,116,198,269 & 272. 

CC -!- MISCELLANEOUS: EU ALSO DIFFERS IN THE AMIDATION STATES OF RESIDUES 



LL 




155, 166, 177, 195, 198, 269, AND 272 AND IN THE ORDER OF RESIDUES 


LL 




268-272. 




- ! - 


MISCELLANEOUS: KOL ALSO DIFFERS IN THE AMIDATION STATES OF 


pp 




RESIDUES 198,267&272. 


CC 

cc 








This SWISS-PROT entry is copyright. It is produced through a collaboration 


cc 


between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 


cc 


the 


European Bioinf ormatics Institute. There are no restrictions on its 


cc 


use 


by non-profit institutions as long as its content is in no way 


cc 


modified and this statement is not removed. Usage by and for commercial 


cc 


entities requires a license agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email to license@isb-sib. ch) . 


EMBL ; J00228; AAC82527.1; ALT INIT. 


DR 


PIR 


■ A93433; GHHU. 


DR 


PDB 


• 1FC1 


■ 15 -JUL- 92. 


DR 


PDB 


• 1FC2 


• 15 -JUL- 92. 


DR 


PDB 


1AJ7 


12 -NOV- 97. 


DR 


PDB 


• 1D5B 


09-FEB-00. 


DR 


PDB, 


1D5I 


09-FEB-00. 


DR 


PDB, 


1D6V 


04-OCT-00. 


DR 


PDB, 


1DN2 


17-MAY-00. 


DR 


PDB, 


1E4K, 


06-JUN-01. 


DR 


PDB, 


1FCC, 


20- JUL- 95. 


DR 


PDB, 


1HZH, 


12-JUN-02. 


DR 


PDB, 


1I7Z, 


08-AUG-01. 


UK 


PDB, 


HIS, 


16-MAY-01. 


UK 


PDB, 


1IIX, 


16-MAY-01 . 


DR 


PDB, 


1L6X, 


10-APR-02. 


DR 


PDB, 


2RCS , 


12-NOV-97. 


DR 


Genew; HGNC:5525; IGHG1 . 


DR 


MIM; 147100; -. 


DR 


GO; 


GO: 0005624; C: membrane fraction; NAS. 


DR 


GO; 


GO:0003823; F: antigen binding activity; TAS. 


DR 


GO; 


GO: 0006955; P : immune response; NAS. 


JJK 


InterPro; 


IPR007110; Ig-like. 


DR 


InterPro; 


IPR003597; Ig cl. 


DR 


InterPro ; 


IPR003006; Ig MHC . 


DR 


Pfam; PF00047; ig; 3. 


DR 


SMART; SM00407; IGcl; 2. 


DR 


PROSITE; PS50835; IG LIKE; 3. 


DR 


PROSITE; PS00290; IG_MHC; 2. 


KW 


Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 


KW 


3D-structure. 


FT 


N0N_ 


TER 


1 1 


FT 


DOMAIN 


1 98 CHI . 


FT 


DOMAIN 


99 110 HINGE. 


FT 


DOMAIN 


Ill 223 CH2. 


FT 


DOMAIN 


224 330 CH3. 


FT 


DISULFID 


27 83 


FT 


DISULFID 


103 103 INTERCHAIN (WITH LIGHT CHAIN) . 


FT 


DISULFID 


109 109 INTERCHAIN (WITH HEAVY CHAIN) . 


FT 


DISULFID 


112 112 INTERCHAIN (WITH HEAVY CHAIN) . 


FT 


DISULFID 


144 204 


FT 


DISULFID 


250 308 


FT 


CARBOHYD 


180 180 N- LINKED (GLCNAC. . .). 


FT 


M0D_ 


RES 


33 0 33 0 REMOVED POST-TRANSLATIONALLY . 
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TURN 
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FT 


STRAND 
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TURN 
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Mill 71 7* TT~\ 
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FT 


TURN 
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FT 
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FT 
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FT 


STRAND 


280 
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FT 


TURN 
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r J. 


b IKANU 
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FT 


HELIX 


297 


301 


FT 


TURN 


302 
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FT 


STRAND 


305 


312 


FT 


TURN 


313 


314 


FT 


TURN 


316 


317 


FT 


STRAND 


320 


325 


SQ 


SEQUENCE 


330 AA; 


361 



K -> R (IN G1M(3) MARKER) . 

/FTId=VAR_003886. 

D -> E (IN GlM(NON-l) MARKER). 

/FTId=VAR_003887. 

L -> M (IN GlM(NON-l) MARKER). 

/FTId=VAR 003888. 



06 MW; 3770EE106C2FA33D CRC64 ; 

Query Match 99.6%; Score 1233; DB 1; Length 330; 

Best Local Similarity 100.0%; Pred. No. 2.2e-93; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 



Db 



2 DKTHTCPPCPAPELLGGPSVFLFPPKPKJ3TLMISRTPEVTCVVVDVSHEDPEVKFNWYVD 61 

IIIIMMIIIIIMIIIIIIIIMIIIIIMIIIIIIIMIIIIIIIIIIIMIIIIII 

104 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 163 



Qy 

Db 

Qy 
Db 



62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

MIIMIIMIIMIIIIIIIIIIMMIIIIIIIIIIMIIIIIIIIMIIIIIIIIII 

164 GVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 223 
122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IMIMIIIIMIIIIIMIIIIIIIIIIIIIMIMMIIIIIIIIIIIIIIIMIIII 

224 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 283 



Qy 



182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 



I llllllllll IIIIIIIIMIIIIII! illllllMMIM 

Db 284 DGSFFLYSKLTVT3KSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 33 0 

RESULT 2 
GC2_HUMAN 

ID GC2_HUMAN STANDARD; PRT; 326 AA. 

AC P01859; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Ig gamma -2 chain C region. 

GN IGHG2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I_Tax I D= 9 6 0 6 ; 
RN [1] 

RP SEQUENCE OF 2-326 FROM N.A. 

RX MEDLINE=82197621; PubMed=68 04 948 ; 

RA Ellison J.W., Hood L.E.; 

RT "Linkage and sequence homology of two human immunoglobulin gamma 

RT heavy chain constant region genes."; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:1984-1988(1982). 

RN [2] 

RP SEQUENCE OF 88-115 FROM N.A. 

RC TISSUE=Fetal liver; 

RX MEDLINE=83001943; PubMed=68 11 13 9 ; 

RA Takahashi N., Ueda S., Obata M. , Nikaido T. , Nakai S., Honjo T . ; 

RT "Structure of human immunoglobulin gamma genes: implications for 

RT evolution of a gene family. "; 

RL Cell 29:671-679(1982). 
RN [3] 

RP SEQUENCE OF 99-177 AND 310-326 FROM N.A. 

RC TISSUE=Fetal liver; 

RX MEDLINE=84235992; PubMed-632 9676 ; 

RA Krawinkel U. , Rabbitts T.H.; 

RT "Comparison of the hinge-coding segments in human immunoglobulin gamma 

RT heavy chain genes and the linkage of the gamma 2 and gamma 4 subclass 

RT genes . " ; 

RL EMBO J. 1:403-407(1982). 

RN [4] 

RP SEQUENCE OF 1-325 (MYELOMA PROTEIN TIL) . 

RX MEDLINE=81007873; PubMed=6774 012 ; 

RA Wang A.-C, Tung E . , Fudenberg H.H.; 

RT "The primary structure of a human IgG2 heavy chain: genetic, 

RT evolutionary, and functional implications."; 

RL J . Immunol. 125:1048-1054(198 0). 

RN [5] 

RP SEQUENCE OF 1-85 AND 132-325 (MYELOMA PROTEIN ZIE) . 

RX MEDLINE=80001357; PubMed=113 060 ; 

RA Connell G.E., Parr D.M., Hofmann T. ; 

RT "The amino acid sequences of the three heavy chain constant region 

RT domains of a human IgG2 myeloma protein."; 

RL Can. J . Biochem. 57:758-767(1979). 

RN [6] 

RP SEQUENCE OF 238-275 (ZIE) . 



RX 


MEDLINE=8 01144 19; PubMed=11892 0 ; 




RA 


Hofmann T., Parr D.M.; 




RT 


"A note of the amino acid sequence of residues 


381-391 of human 


RT 


immunoglobulins gamma chains."; 




RL 


Mol . Immunol. 16:923-925(1979). 




KIN 


I I \ 




Kir 


Kh,vlbIUNb IU zb ; 59; 60 AND 264-268 (ZIE) . 




KA 


Hofmann T. , Parr D.M. ; 




RL 


Submitted (MAR-1980) to the PIR data bank. 




RN 


r n T 

[8] 




RP 


SEQUENCE OF 1-121 (DOT) . 




KA 


MhDLINE=952552 98 ; PubMed=7 737190 ; 




KA 


Stoppini M. , Bellotti V. , Negri A. , Merlini G. , 


Garver F., Ferri G. ; 


Ki 


"Characterization of the two unique human anti- 


flavin monoclonal 


RT 


immunoglobulins . " ; 




RL 


Eur. J. Biochem. 228:886-893(1995). 




RN 


[9] 




KP 


DISULFIDE BONDS. 




RX 


MEDLINE=72033500 ; PubMed=4 940472 ; 




"D 7\ 

KA 


Milstein C, Frangione B.; 




RT 


"Disulphide bridges of the heavy chain of human 


immunoglobulin G2 . " ; 


RL 


Biochem. J. 121:217-225(1971). 




DM 
X\iN 


[10] 




Kir 


DISULFIDE BONDS. 




DV 
KA 


MEDLINE=69064124; PubMed=5782707 ; 






Frangione B., Milstein C, Pink J.R.L.; 




RT 


"Structural studies of immunoglobulin G."; 




RT, 

cc 


Nature 221:145-148 (1969) . 




cc 


This SWISS-PROT entry is copyright. It is produced through a collaboration 


cc 


between the Swiss Institute of Bioinf ormat ics 


and the EMBL outstation - 


cc 


the European Bioinf ormatics Institute. There are no restrictions on its 


cc 


use by non-profit institutions as long as its content is in no way 


cc 


modified and this statement is not removed. Usage by and for commercial 


cc 


entities requires a license agreement (See http 


: //www. isb-sib. ch/announce/ 




or send an email to license@isb-sib. ch) . 




UL- 






UK 


EMBL; J0023 0; AAB59393.1; -. 




DR 


PIR; A93906; G2HU. 




DR 


HSSP; P01857; 1FC1 . 




DR 


Genew; HGNC:5526; IGHG2 . 




DR 


MIM; 147110; 




DR 


GO; GO: 0005624; C: membrane fraction; NAS. 




DR 


GO; GO: 0003823; F: antigen binding activity; TAS 




DR 


GO; GO: 0006955; P : immune response; NAS. 




JJK 


InterPro; IPR007110; Ig-like. 




DR 


InterPro; IPR003597; Ig_cl . 




DR 


InterPro; IPR003006; IgJVlHC. 




DR 


Pfam; PF00047; ig; 3. 




DR 


SMART; SM00407; IGcl; 2. 




DR 


PROSITE; PS50835; IG LIKE; 3. 




DR 


PROSITE; PS00290; IG_MHC; 2. 




KW 


Immunoglobulin domain; Immunoglobulin C region. 




FT 


NON_TER 1 1 




FT 


DOMAIN 1 98 CHI. 




FT 


DOMAIN 99 110 HINGE. 




FT 


DOMAIN 111 219 CH2. 





r 1 


1JUMAIN 


220 


326 


CH3 . 




r 1 


Ul oULir 1 U 


1 A 

±4 


T A 
14 


INTERCHAIN (WITH A LIGHT CHAIN) . 




FT 




j£ / 


O -D 






FT 


FIT CUTT.CTn 


i no 


1 AO 


INTERCHAIN (WITH A HEAVY CHAIN) . 




F 1 


nj CT TT .T7T n 
JJlDULir 1 JJ 


i m 

1U j 


1 fio 

1UJ 


INTERCHAIN (WITH A HEAVY CHAIN) . 




r 1 


HT CT TT DTH 

UloU-Lr I JJ 


±Uo 


106 


INTERCHAIN (WITH A HEAVY CHAIN) . 




T7T 1 
r 1 






109 


INTERCHAIN (WITH A HEAVY CHAIN) . 




FT 


DISULFID 


140 


2 00 






FT 


DISULFID 


246 


304 






FT 


SITE 


156 


156 


AT OR NEAR THE COMPLEMENT- BINDING 


SITE. 


FT 


MOD_RES 


326 


326 


REMOVED POST - TRANSLAT I ONALL Y (PROBABLE) 


FT 


VARIANT 


60 


60 


S -> A (IN MYELOMA PROTEINS TIL & 


ZIE) . 


FT 








/FTId=VAR__003889 . 




FT 


CONFLICT 


109 


109 


C -> S (IN REF. 3) . 




SQ 


SEQUENCE 


326 AA; 


35884 


MW; 8310878C6878CF9C CRC64 ; 





Query Match 92.3%; Score 1142.5; DB 1; Length 326; 

Best Local Similarity 94.1%; Pred. No. 5e-86; 

Matches 209; Conservative 8; Mismatches 4; Indels 1; Gaps 1; 

CPPCPAPELLGGPSVFLFPPKPKDTLMI SRT P E VTC VWD VS HED P E VKFNW Y VDG VE VH 66 

I M 1 1 M : 1 1 M M I INI I MM II III II I II I II III llllhl llllll II 1 1 

CPPCPAPP-VAGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVQFNWYVDGVEVH 164 
NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

MMMMIhllhlMMMhlMMMMMMMM II I Mill Ml 1 1 Ml I 

NAKTKPREEQFNSTFRWS VLTWHQDWLNGKE YKCKVSNKGLPAP I EKTI SKTKGQPRE 224 
PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 186 

IMIIIIIIhhIIIIIIIIIMIIMIIIIMIIIIIIIIIIIMIIIhlMIIIM 

PQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPMLDSDGSFF 284 
L YS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 228 

llllllllllllllllllllllllllllllllllllllllll 

LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 326 



Qy 


7 


Db 


106 


Qy 


67 


Db 


165 


Qy 


127 


Db 


225 


Qy 


187 


Db 


285 



RESULT 3 
GC4_HUMAN 

ID GC4_HUMAN STANDARD; PRT; 327 AA. 

AC P01861; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma-4 chain C region. 

GN IGHG4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Crania ta ; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I_TaxI D= 9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=83157104; PubMed=62 99662 ; 

RA Ellison J.W. , Buxbaum J.N. , Hood L.E.; 

RT "Nucleotide sequence of a human immunoglobulin C gamma 4 gene."; 

RL DNA 1:11-18 (1981) . 

RN [2] 



RP 
RX 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



SEQUENCE OF 1-30 AND 81-326. 
MEDLINE=70207560; PubMed=4 192699 ; 

Pink J.R.L., Buttery S.H., de Vries G.M. , Milstein C. ; 

"Human immunoglobulin subclasses. Partial amino acid sequence of the 

constant region of a gamma 4 chain."; 

Biochem. J. 117:33-47(1970). 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinformatics and the EMBL outstation - 
the European Bioinformatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; K01316; AAB59394.1; ALT_INIT. 
PIR; A90933; G4HU. 
PDB; 1ADQ; 16-SEP-98 . 
Genew; HGNC:5528; IGHG4 . 
MIM; 147130; -. 

GO; GO: 0005624; C:membrane fraction; NAS. 

GO; GO: 0003823; F: antigen binding activity; TAS . 

GO; GO: 0006955; P : immune response; NAS. 

InterPro; IPR007110; Ig-like. 

InterPro; IPR003597; Ig__cl . 

InterPro; IPR003 006; IgJVIHC. 

Pfam; PF00047; ig; 3. 

SMART; SM004 07; IGcl ; 2. 

PROSITE; PS50835; IG_LIKE; 3. 

PROSITE; PS00290; IG_MHC; 2. 

Immunoglobulin domain; Immunoglobulin C region; 3D-structure . 



N0N_TER 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
SEQUENCE 



1 
1 
99 
111 
221 
14 
27 
106 
109 
141 
247 
327 AA; 



1 

98 
110 
220 
327 

14 

83 
106 
109 
201 
305 

35940 



CHI. 
HINGE. 
CH2. 
CH3. 

INTERCHAIN 

INTERCHAIN 
INTERCHAIN 



(WITH A LIGHT CHAIN) . 



(WITH A 
(WITH A 



HEAVY CHAIN) 
HEAVY CHAIN) 



MW; 3EDBD811EF2 08E7A CRC64 ; 



Query Match 91.7%; Score 1135; DB 1; Length 327; 

Best Local Similarity 93.7%; Pred. No. 2.1e-85; 

Matches 2 08; Conservative 8; Mismatches 6; Indels 0; 



Gaps 



0; 



QY 



Db 



Qy 



Db 



7 CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH 66 

II Mill ■ I 1 1 1 1 1 1 1 I 1 1 1 , 1 . i I , [ 1 1 , 1 . 1 , 1 llllhlllllllllll 

106 CPSCPAPEFLGGPSWLFPPKPKDTLMISRTPEVTCWVDVSQE 165 
67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 126 

MIM Ml IhlMII III MINIM MUM MM MM lh MMMIMIMM 

166 NAKTKPREEQFNSTYRWSVLTVLHQDWLNGKEYKCKVSNKGLPSS I EKTI SKAKGQPRE 225 



QY 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 186 



Mlllllll^hllllllllllMIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIII 

Dt) 226 PQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 28 5 

Qy 187 LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I h i M I I I I I h I I I I I I I I I I I I I I I II I I I I I I | || 
Db 286 LYSRLTVDKSRWQEGNVFSCSVMHEALHNHYTQKSLSLSLGK 327 



RESULT 4 
GC3_HUMAN 

ID GC3_HUMAN STANDARD; PRT; 290 AA. 

AC P01860; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Ig gamma-3 chain C region {Heavy chain disease protein) (HDC) 

GN IGHG3 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata,- Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE (DISEASE PROTEIN WIS) . 

RX MEDLINE=81021548; PubMed=6774747 ; 

RA Frangione B. , Rosenwasser E., Prelli F., Franklin E.C.; 

RT "Primary structure of human gamma 3 immunoglobulin deletion mutant: 



RT gamma 3 heavy-chain disease protein Wis."; 

RL Biochemistry 19:4304-4308(1980). 

RN [2] 

RP REVISIONS TO 12-97 (PROTEIN WIS) . 

RX MEDLINE=77118561; PubMed=4 02363 ; 

RA Michaelsen T.E., Frangione B., Franklin E.C.; 

RT "Primary structure of the 'hinge' region of human IgG3 . Probable 

RT quadruplication of a 15-amino acid residue basic unit."; 

RL J. Biol. Chem. 252:883-889(1977). 

RN [3] 

RP REVISIONS TO 59-289 (PROTEIN WIS/DISEASE PROTEIN ZUC) . 

RX MEDLINE-77021516; PubMed=823945 ; 

RA Wolfenstein-Todel C. , Frangione B. , Prelli F., Franklin E.C.; 

RT "The amino acid sequence of 'heavy chain disease 1 protein ZUC. 

RT Structure of the Fc fragment of immunoglobulin G3 . " ; 

RL Biochem. Biophys. Res. Commun. 71:907-914(1976). 

RN [4] 

RP SEQUENCE FROM N.A. (DISEASE PROTEIN OMM) . 

RX MEDLINE=82247835; PubMed=6808505 ; 

RA Alexander A., Steinmetz M., Barritault D., Frangione B., 

RA Franklin E.C., Hood L. , Buxbaum J.N. ; 

RT "Gamma Heavy chain disease in man: cDNA sequence supports partial 

RT gene deletion model ." ; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:3260-3264(1982). 

CC -!- SUBUNIT: DIMER LINKED BY 12 DISULFIDE BONDS; IT HAS AN EXTRA 
CC INTERCHAIN DISULFIDE BOND AT POSITION 7 IN ADDITION TO THE 11 

CC NORMALLY PRESENT IN THE HINGE REGION. 

CC -!- MISCELLANEOUS: THE HEAVY CHAIN DISEASE PROTEIN WIS IS SHOWN. 

CC -!- MISCELLANEOUS: THE SEQUENCE OF RESIDUES 42-76 WAS TAKEN FROM THE 

CC REF . 2 . 

CC -!- MISCELLANEOUS: DISEASE PROTEIN WIS IS LACKING MOST OF THE V REGION 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 



AND ALL OF THE CHI REGION. 
-!- MISCELLANEOUS: DISEASE PROTEIN ZUC LACK MOST OF THE V REGION, ALL 

OF THE CHI REGION, AND PART OF THE HINGE COMPARED WITH NORMAL 

GAMMA -3 HEAVY CHAINS. 
-!- MISCELLANEOUS: DISEASE PROTEIN OMM MAY REPRESENT AN ALLELIC FORM 

OR ANOTHER GAMMA CHAIN SUBCLASS. 
-!- MISCELLANEOUS: THE HINGE REGION IN GAMMA -3 CHAINS IS ABOUT FOUR 

TIMES AS LONG AS IN OTHER GAMMA CHAINS AND CONTAINS THREE 

IDENTICAL 15-RESIDUE SEGMENTS PRECEDED BY A SIMILAR 17-RESIDUE 

SEGMENT (12-28) . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; J00231; AAA52805.1; ALT_SEQ. 
HSSP; P01857; 1FC1 . 
Genew; HGNC:5527; IGHG3 . 
MIM; 147120; -. 

GO; GO: 0005624; C: membrane fraction; MAS . 
GO; GO: 0003823; F: antigen binding activity; TAS. 
GO; GO: 0006955; P : immune response; NAS . 
InterPro; IPR007110; Ig-like. 
InterPro; IPR003597; Ig_cl. 
InterPro; IPR003006; Ig_MHC. 
Pfam; PF00047; ig ; 2. 
SMART; SM00407; IGcl; 1. 
PROSITE; PS50835; IG_LIKE; 2. 
PROSITE; PS00290; IG_MHC; 1. 

Immunoglobulin domain; Immunoglobulin C region; Glycoprot 
Pyrrolidone carboxylic acid. 



ein; Repeat; 



DOMAIN 
DOMAIN 
DOMAIN 
REPEAT 
REPEAT 
REPEAT 
MOD_RES 
CARBOHYD 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
DISULFID 
CARBOHYD 
MOD RES 



12 
74 
184 
29 
44 
59 



24 
27 
33 
39 
42 
48 
54 
57 
63 
69 
72 
140 
290 



73 
183 
289 
43 
58 
73 



24 
27 
33 
39 
42 
48 
54 
57 
63 
69 
72 
140 
290 



HINGE . 
CH2 . 
CH3. 



PYRROLIDONE CARBOXYLIC ACID. 
N- LINKED (GLCNAC. . .). 
INTERCHAIN (WITH HEAVY CHAIN DIMER) . 

(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
(WITH HEAVY CHAIN DIMER) . 
N-LINKED (GLCNAC. . .). 
REMOVED POST-TRANSLATIONALLY. 



INTERCHAIN 
INTERCHAIN 
INTERCHAIN 
INTERCHAIN 
INTERCHAIN 
INTERCHAIN 
INTERCHAIN 
INTERCHAIN 
INTERCHAIN 
INTERCHAIN 
INTERCHAIN 



r 1 


VAK1AN 1 


126 


127 


QV -> EB (IN ZUC) . 


r 1 








/FTIa=VAR 003890. 


r 1 


UADT A VTT 1 

VAKIAJM 1 


134 


134 


P -> L (IN OMM) . 


Dm 

r 1 








/FTId=VAR 003891. 


r 1 


T T7\ OTA TiTT* 

VAK1AJM 1 


139 


139 


F -> Y (IN OMM) . 


r 1 








/FTId=VAR_0038 92 . 


r 1 


T 77V T3 T A MT 

VAK1AJNJ 1 


TOO 

loz 


182 


T -> A (IN OMM) . 


r 1 








/FTIa=VAR 003893 . 


FT 


VARIANT 


227 


227 


S -> N (IN OMM) . 


FT 








/FTId=VAR_003894 . 


FT 


VARIANT 


227 


227 


MISSING (IN ZUC) . 


FT 








/FTId=VAR_003895. 


FT 


VARIANT 


279 


279 


F -> Y (IN OMM) . 


FT 








/FTId=VAR_003896. 


SQ 


SEQUENCE 


290 AA; 


32331 


MW; E69CBC95705B2F46 



Query Match 91.0%; Score 1126/ DB 1; Length 290; 

Best Local Similarity 90.3%; Pred. No. 9.6e-85; 

Matches 2 05; Conservative 13; Mismatches 9; Indels 0; Gaps 0; 



Qy 


2 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVvT)VSHEDPEVKFNWYVD 

1 M IIMIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIhl MM 

DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVQFKWYVD 


61 


Db 


64 


123 


Qy 


62 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 

1 1 M 1 M II M 1 h h 1 1 h II 1 M 1 1 1 1 M : 1 h 1 1 II II 1 1 1 1 1 M II M II 1 II 1 1 

GVQVHNAKTKPREQQFNSTFRWSVLTVLHQNWLDGKEYKCKVSNKALPAPI EKTI SKTK 


121 


Db 


124 


183 


Qy 


122- 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 
: ' i 1 . , 1 1 ' : , . = , ! ,,|| ' . , 1 | | | | : | | | 
GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 


181 


Db 


184 


243 


Qy 


182 


DGS FFL YS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 228 

1 1 M II 1 M 1 1 1 M II 1 1 M h II 1 II 1 II 1 1 M MMIMIMM 

DGS FFL YS KLTVDKSRWQQGN I FS CS VMHEALHNRFTQKSLSLS PGK 290 




Db 


244 





RESULT 5 
GC_RABIT 

ID GC_RABIT STANDARD; PRT; 323 AA. 

AC P01870; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma chain C region. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCB I_TaxI D= 9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=84030930; PubMed=6313520 ; 

RA Bernstein K.E., Alexander C.B., Mage R.G.; 

RT "Nucleotide sequence of a rabbit IgG heavy chain from the recombinant 

RT F-I haplotype.",- 

RL Immunogenetics 18:387-397(1983). 

RN [2] 



RP SEQUENCE OF 1-128. 

RX MEDLINE=76135469; PubMed=1243651 ; 
RA Pratt D.M. , Mole L.E.; 

RT "Sequence studies on the constant region of the Fd sections of rabbit 
RT immunoglobulin G of different allotype."; 
RL Biochem. J . 151:337-349(1975). 
RN [3] 

RP SEQUENCE OF 88-266 FROM N.A. 

RX MEDLINE=83299917; PubMed=6193512 ; 

RA Martens C.L., Moore K.W., Steinmetz M., Hood L., Knight K.L.; 

RT "Heavy chain genes of rabbit IgG: isolation of a cDNA encoding gamma 

RT heavy chain and identification of two genomic C gamma genes . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:6018-6022(1982). 

RN [4] 

RP SEQUENCE OF 132-161. 

RX MEDLINE=70110015; PubMed=54611 06 ; 

RA Fruchter R.G. , Jackson S.A., Mole L.E., Porter R.R.; 

RT "Sequence studies of the Fd section of the heavy chain of rabbit 

RT immunoglobulin G."; 

RL Biochem. J. 116:249-259(1970). 

RN [5] 

RP SEQUENCE OF 129-131 AND 155-322. 

RA Hill R.L., Lebovitz H.E., Fellows R.E. Jr., Delaney R. ; 
RL (In) Kil lander J. (eds.); 

RL Gamma globulins, Nobel symp. 3, pp. 109-127, Almqvist and Wiksell, 

RL Stockholm (1967) . 

CC -!- MISCELLANEOUS: REF . 1 SEQUENCE HAS THE D12 ALLOTYPIC MARKER, 

CC 104-THR, AND THE E14 MARKER, 185-THR. REF . 3 HAS THE Dll AND E15 

CC MARKERS AND REF. 5 THE El 5 MARKER. 

CC -!- SIMILARITY: Contains 3 immunoglobulin- like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; M16426; AAA31289.1; -. 

DR PIR; A9174 9; GHRB . 

DR HSSP; P01857; 1FC1 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; IgJVIHC. 

DR Pfam; PF00047; ig ; 2. 

DR SMART; SM004 07; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IGJ4HC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Repeat. 

FT N0N_TER 1 1 

FT DOMAIN 6 96 IG-LIKE 1. 

FT DOMAIN 114 213 IG-LIKE 2. 

FT DOMAIN 222 318 IG-LIKE 3. 

FT VARIANT 104 104 T -> M (IN Dll MARKER) . 

FT VARIANT 185 185 T -> A (IN E15 MARKER) . 

FT CONFLICT 48 48 N -> E (IN REF . 2). 



r I 


LUIMrLl LI 


71 


71 


V 


-> 


VPV (IN REF. 2) . 


rr"-p 


LUWr J_il LI 


144 


144 


Q 


-> 


E 


(IN REF. 


3 AND 4) . 


r i 


PHMUT T <~"T 
rLlLl 


1 /J 


173 


N 


-> 


D 


(IN REF. 


5) . 


r 1 


LUIMr LilLl 


TOT 

lo / 


187 


Q 


-> 


E 


(IN REF. 


3 AND 5) . 


r 1 


LUJNr J_i± L 1 




201 


N 


-> 


D 


(IN REF. 


5) . 


C 1 


^*J1N r ±j 1 L 1 


z 1 o 




Q 


-> 


E 


(IN REF . 


5) . 


FT 


CONFLICT 


233 


233 


E 


-> 


Q 


(IN REF. 


5) . 


FT 


CONFLICT 


246 


246 


N 


-> 


D 


(IN REF. 


5) . 


FT 


CONFLICT 


256 


256 


E 


-> 


G 


(IN REF. 


5) . 


FT 


CONFLICT 


260 


260 


N 


-> 


D 


(IN REF. 


5) . 


FT 


CONFLICT 


266 


266 


N 


-> 


D 


(IN REF. 


5) . 


FT 


CONFLICT 


280 


280 


Y 


- > 


W 


(IN REF. 


5) . 


FT 


CONFLICT 


284 


284 


N 


-> 


S 


(IN REF . 


5) . 


SQ 


SEQUENCE 


323 AA; 


35404 MW; 




69E8AA118D579A8B CRC64 ; 


Query Match 




74 .2%; 


Score 918.5; DB 


■ 1 ; Leng 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



Best Local Similarity 71.7%; Pred. No. 8.9e-68; 
Matches 167; Conservative 29; Mismatches 32; Indels 5; Gaps 2; 

1 MDKT---HTC--PPCPAPELLGGPSVFLFPPKPKDTLMISRTPEvTC^VVlDVSHEDPEVK 55 

I M I I I I I I I I : II I I I I I I I I I I I M I I I I I I I I I I : I I I I : 

91 VDKTVAPSTCSKPTCPPPELLGGPSVFI FPPKPKDTLMI SRTPEVTCVWDVSQDDPEVQ 150 

56 FNWYVTDGVEVTiNAKTKPREEQYNSTYRWS VXiT VLHQDWLNGKE YKCKVSNKALPAP I EK 115 
I II:: : I h 11=1=111 I I I I I : I.I Mhllll MINIUM 
151 FTWYINNEQVRTARPPLREQQFNSTIRWSTLPITHQDWLRGKEFKCKVHNKALPAPIEK 210 

116 TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTT 175 

'Ml !,' . . ,| I |:||: HIM:: MINIMUM ||: I 

211 TI SKARGQPLEPKVYTMGPPREELSSRSVSLTCMINGFYPSDI SVEWEKNGKAEDNYKTT 270 

176 PPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I llllllhllhlhl I Ihhihillllllllllllllhl I I I I 

271 PAVLDSDGS YFLYNKLSVPTSEWQRGDVFTCSVMHEALHNHYTQKS I SRSPGK 323 



RESULT 6 
GC2_CAVPO 

ID GC2_CAVPO STANDARD; PRT; 32 9 AA. 

AC P01862; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-2 chain C region. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCB I _Tax ID=10141; 

RN [1] 

RP SEQUENCE OF 1-3. 

RA Trischmann T.M.; 

RL Submitted (APR-1975) to the PIR data bank. 

RN [2] 

RP SEQUENCE OF 4-68. 

RX MEDLINE=71058471; PubMed=5538606 ; 

RA Birshtein B.K., Hussain Q.Z., Cebra J.J.; 

RT "Structure of heavy chain from strain 13 guinea pig 



RT immunoglobulin-G(2) . 3. Amino acid sequence of the region around the 

RT half-cystine joining heavy and light chains."; 
RL Biochemistry 10:18-25(1971). 
RN [3] 

RP SEQUENCE OF 69-133 AND 312-329. 

RX MEDLINE=71058486; PubMed=5538616 ; 

RA Turner K.J., Cebra J.J.; 

RT "Structure of heavy chain from strain 13 guinea pig 

RT immunoglobulin-G(2) . II. Amino acid sequence of the carboxyl -terminal 

RT and hinge region cyanogen bromide fragments."; 

RL Biochemistry 10:9-17(1971). 
RN [4] 

RP SEQUENCE OF 134-226. 

RX MEDLINE=75036072; PubMed=4429665 ; 

RA Tracey D.E., Cebra J.J.; 

RT "Primary structure of the CH2 homology region from guinea pig IgG2 

RT antibodies. "; 

RL Biochemistry 13:4796-48 03(1974). 
RN [5] 

RP SEQUENCE OF 227-311. 

RX MEDLINE=75036073; PubMed=4609467 ; 

RA Trischmann T.M., Cebra J.J.; 

RT "Primary structure of the CH3 homology region from guinea pig IgG2 

RT antibodies."; 

RL Biochemistry 13:4804-4811(1974). 
RN [6] 

RP DISULFIDE BONDS. 

RX MEDLINE=71058474; PubMed=4 922544 ; 

RA Oliveira B., Lamm M.E.; 

RT "Interchain disulfide bridges of guinea pig gamma-2- immunoglobulin. " ; 

RL Biochemistry 10:26-31(1971). 

CC -!- MISCELLANEOUS: THIS CHAIN WAS ISOLATED FROM POOLED SERUM OF STRAIN 

CC 13 INBRED GUINEA PIGS. 

DR PIR; A94553; G2GP. 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 2. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROS I TE ; PS 0 0 2 9 0 ; I G_MHC ; 1 . 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein. 

FT NON_TER 1 1 

FT DISULFID 16 16 INTERCHAIN (WITH A LIGHT CHAIN) . 

FT DISULFID 28 79 

FT DISULFID 105 105 INTERCHAIN (WITH A HEAVY CHAIN). 

FT DISULFID 107 107 INTERCHAIN (WITH A HEAVY CHAIN) . 

FT DISULFID 110 110 INTERCHAIN (WITH A HEAVY CHAIN) . 

FT DISULFID 142 202 

FT CARBOHYD 178 178 N- LINKED (GLCNAC. . .). 

FT DISULFID 248 308 

SQ SEQUENCE 329 AA; 36074 MW; 5D231B7164D1FBA9 CRC64; 



Query Match 71.8%; Score 88 9; DB 1; Length 329; 

Best Local Similarity 72.3%; Pred. No. 2.3e-65; 

Matches 162; Conservative 24; Mismatches 36; Indels 2; Gaps 1; 



Qy 


6 


Db 


106 


Qy 


66 


JJD 


166 


Qy 


126 


Db 


226 


Qy 


184 


Db 


286 



Ml Mil M Mllhl Mill llllll II lllllllll - || |:| |:|| | 
TCPKCPPPENLGGPSVFIFPPKPKDTLMISLTPRWCVWDVSQDEPEVQFTWFVDNKPV 

HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPR 

1 1 m ii i iiihhii in : iiiii iihiiii i ii r I I I I I | | | | | || || 

GNAETKPRVEQYNTTFRVESVLP I QHQDWLRGKEFKCKVYNKALPAP I EKTI SKTKGAPR 

EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP- -ENNYKTTPPVLDSDG 
I I I I I I I I I I I h I - I I : I I I : hhll ||| Mllll I |:|| 
MPDVYTLPPSRDELSKSKVSVTCLI INFFPADIHVEWASNRVPVSEKEYKNTPPIEDADG 

SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 227 

hlllllllllll I II hMMMMIMI III 



RESULT 7 
GC3_MOUSE 

ID GC3_MOUSE STANDARD; PRT; 329 AA. 

AC P22436; 

DT 01 -AUG- 1991 (Rel. 19, Created) 

DT 01-AUG-1991 (Rel . 19, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Ig gamma-3 chain C region, secreted form. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85027161; PubMed=6092 053 ; 

RA Wels J. A., Word C.J., Rimm D. , Der-Balan G.P., Martinez H.M. , 

RA Tucker P.W., Blattner F.R.; 

RT "Structural analysis of the murine IgG3 constant region gene."; 

RL EMBO J. 3:2041-2046(1984). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J00451; -; NOT_ANNOTATED_CDS . 

DR PIR; B02156; G3MSC . 

DR HSSP; P01857; 1FC1 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; I PRO 03 006; Ig_MHC. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM004 07; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 



KW 


Transmembrane; Alternative splicing. 


FT 


NON_TER 


1 


1 




FT 


DOMAIN 


1 


97 


CHI. 


FT 


DOMAIN 


98 


113 


HINGE. 


FT 


DOMAIN 


114 


223 


CH2 . 


FT 


DOMAIN 


224 


327 


CH3 . 


SQ 


SEQUENCE 


329 AA; 


36228 


MW; F45827 



Query Match 68.3%; Score 845.5; DB 1; Length 329; 

Best Local Similarity 67.0%; Pred. No. 8.1e-62; 

Matches 150; Conservative 34; Mismatches 37; Indels 3; Gaps 1; 

Qy 8 PP CPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCWVDVSHEDPEVKFNWYVDGVE 64 

II II I I Ml TUN I I I I Ihlllllllll :|h| :|:|| I 

Db 106 PPGSSCPPGNILGGPSVFIFPPKPKDALMISLTPKVTCVWDVSEDDPDVHVSWFVDNKE 165 

Qy 65 WNAKTKPREEQYNSTYRWS VLT VLHQDWLNGKEYKCKVSNKALPAP I EKTI SKAKGQP 124 

M I hill 11111 = 1111 I = I I I I : llhlllhlMlllllhllll I 
Db 166 VHTAWTQPREAQYNSTFRWSALPIQHQDWMRGKEFKCKVNNKALPAPI ERTI SKPKGRA 225 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

_ : llllhll h:::| =1111111 h hill lh | :|| Mhllllh 
Db 226 QTPQVYT I P P PREQMS KKKVSLTCLVTNFFS EAI S VEWERNGELEQDYKNTP P I LDSDGT 285 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

■ I. ' I II ^ 'II , I I I I 

Db 286 YFLYSKLTVDTDSWLQGEIFTCSWHEALHNHHTQKNLSRSPGK 329 



RESULT 8 
GCB_RAT 

ID GCB__RAT STANDARD; PRT; 333 AA. 

AC P20761; 

DT 01-FEB-1991 (Rel. 17, Created) 

DT 01-FEB-1991 (Rel . 17, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma-2B chain C region. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCB I _Tax ID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89232738; PubMed=3 14994 6 ; 

RA Brueggemann M . ; 

RT "Evolution of the rat immunoglobulin gamma heavy-chain gene family."; 

RL Gene 74:473-482(1988). 

CC -!- SIMILARITY: Contains 3 immunoglobulin-like domains. 

DR PIR; PS0018; PS0018 . 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003 006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 



KW 


Immunoglobulin domain; 


I mmunogl obul in C 


r eg ion; 


Repeat . 


FT 


MOM TPR 


X 


x 








FT 


UKJV'ln.A.v* 


r 
D 


yfo 


XLj^XjX xVci X . 






r i 


DOMAIN 


124 


223 


T f~1 T T T/tT" O 

1(j-L1Ke, z . 






r 1 


DOMAIN 


232 


328 


IG-LIKE 3. 






b I 


DISULFID 


15 


15 


INTERCHAIN 


(WITH A 


LIGHT CHAIN) 


FT 


DISULFID 


27 


80 








FT 


DISULFID 


106 


106 


INTERCHAIN 


(WITH A 


HEAVY CHAIN) 


FT 


DISULFID 


109 


109 


INTERCHAIN 


(WITH A 


TTT7 7\ T T\T /"ITT A T "NT \ 


FT 


DISULFID 


112 


112 


INTERCHAIN 


(WITH A 


HEAVY CHAIN) 


FT 


DISULFID 


115 


115 


INTERCHAIN 


(WITH A 


HEAVY CHAIN) 


FT 


DISULFID 


147 


207 








FT 


DISULFID 


253 


311 








SQ 


SEQUENCE 


333 AA; 


36497 MW; 55F8B64D48D460A6 


CRC64 ; 


Query Match 




67 


.7%; Score 838; 


DB 1; 


Length 333; 



Best Local Similarity 66.4%; Pred. No. 3.3e-61; 
Matches 148; Conservative 33; Mismatches 42; Indels 



0; Gaps 



0; 



Qy 


6 


Db 


111 


Qy 


66 


Db 


171 


Qy 


126 


Db 


231 


Qy 


186 


Db 


291 



M M lllllllllhlMIIII hl|: :|||llllll hhhhhh Ml 



I hhlllllMIMIII I : I ! I I . , i ' 



Ml = I ::|h I II I II II h II III III I II 



■ I M 



hllll 



I [(III lllh : I I : I Ml 



RESULT 9 
GC3M_MOUSE 

ID GC3M_M0USE STANDARD; PRT; 3 98 AA. 

AC P03987; 

DT 23-OCT-1986 (Rel . 02, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-3 chain C region, membrane -bound form. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCB I_TaxI D= 10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=85027161; PubMed=6092053 ; 

RA Wels J. A., Word C.J., Rimm D. , Der-Balan G.P., Martinez H.M., 

RA Tucker P.W. , Blattner F.R. ■ 

RT "Structural analysis of the murine IgG3 constant region gene."; 

RL EMBO J. 3:2041-2046(1984). 

RN [2] 

RP SEQUENCE OF 328-398 FROM N.A. 



RX MEDLINE=84041483; PubMed=6314258 ; 

RA Komaromy M. , Clayton L. , Rogers J., Robertson S., Kettman J., 

RA Wall R . ; 

RT "The structure of the mouse immunoglobulin in gamma 3 membrane gene 

RT segment . " ; 

RL Nucleic Acids Res. 11:6775-6785(1983). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J00451; AAB59655.1; -. 

DR EMBL; V01526; CAA24767.1; ALT_SEQ . 

DR PIR; A02156; G3MSM . 

DR HSSP; P01857; 1FC1 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003 006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 
Transmembrane; Alternative splicing. 

FT NON_TER 11 

FT DOMAIN 1 97 CHI. 

FT DOMAIN 98 113 HINGE. 

FT DOMAIN 114 223 CH2 . 

FT DOMAIN 224 327 CH3 . 

FT TRANSMEM 346 362 POTENTIAL. 

FT DOMAIN 363 3 98 CYTOPLASMIC (POTENTIAL) . 

FT CONFLICT 333 333 E -> G (IN REF . 2). 

FT CONFLICT 342 342 E -> Q (IN REF. 2) . 

FT CONFLICT 388 388 P -> F (IN REF. 2) . 

SQ SEQUENCE 398 AA; 43929 MW; CF7F264B50A41B95 CRC64 ; 

Query Match 67.4%; Score 834.5; DB 1; Length 398; 

Best Local Similarity 66.7%; Pred. No. 8e-61; 

Matches 148; Conservative 34; Mismatches 37; Indels 3; Gaps 1; 

QY 8 PP---CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVE 64 

II M : I I I I I I I : I I I I I I I I M I I I : I I I I I I I I I : I I : I :|:|| | 
Db 106 PPGSSCPPGNILGGPSVFIFPPKPKDALMISLTPKVTCVWDVSEDDPDVHVSWFVDNKE 165 

Qy 65 VHNAKTKPREEQYNSTYRWSVIjTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

lfi M I 1 = 111 IN I hill I I : llih |||:| | | |:|||M Mlh I I II Ih 
Db 166 VHTAWTQPREAQYNSTFRWSALPIQHQDWMRGKEFKCKVNNKALPAPIERTISKPKGRA 225 

QY 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: M M I : I I I : : : : I I I II I I I h I : I I I I I I : | : | | | | | = M I II = ■ 
Db 226 QTPQVYTI PPPREQMSKKKVSLTCLVTNFFSEAI SVEWERNGELEQDYKNTPPILDSDGT 285 



KW 
KW 



Qy 



185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 226 



MINIUM I II :hl I hllllll H I M I || 
Db 286 YFLYSKLTVDTDSWLQGEIFTCSWHEALHNHHTQKNLSRSP 327 



RESULT 10 








RAT 






t n 


GC1 RAT 


STANDARD; PRT; 326 AA. 




AC 


P20759; 






DT 


01-FEB-1991 (Rel. 17, Created) 




JJ 1 


01-FEB-1991 (Rel. 17, Last sequence update) 




U 1 


15-JUL-1999 (Rel. 38, Last annotation update) 




UjCj 


Ig gamma - 


1 chain C region. 




nc 

Uo 


Rattus norvegicus (Rat) . 




or 1 


Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 




Mammalia; 


Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus 


nv 

vJA 


NCBI TaxID=10116; 




KIM 


[1] 






Kr 


SEQUENCE 


FROM N.A. 




DV 

KA 


MEDLINE=8 9232738; PubMed=3 14 9946 ; 




RA 


Brueggemann M. ; 






"Evolution of the rat immunoglobulin gamma heavy-chain gene family." 


DT 
KJ_i 


Gene 74:473-482(1988). 




UK 


PIR; PS0017; PS0017. 




UK 


HSSP; P01842; 7FAB . 




JJK 


InterPro; 


IPR007110; Ig-like. 




UK 


InterPro; 


IPR003597; Ig_cl . 




DR 


InterPro; 


IPR003006; Ig MHC. 




DR 


Pfam; PF00047; ig ; 3. 




DR 


SMART; SM00407; IGcl; 2. 




np 
iJi\ 


PROSITE; 


PS50835; IG_LIKE; 3. 




UK 


PROSITE; 


PS00290; IG MHC; 1. 




xVvv 


Immunoglobulin domain; Immunoglobulin C region; 


Glycoprotein. 


FT 


NON_TER 


1 1 




FT 


DOMAIN 


1 97 CHI. 




FT 


DOMAIN 


98 112 HINGE. 




FT 


DOMAIN 


113 219 CH2 . 




FT 


DOMAIN 


220 326 CH3 . 




FT 


DISULFID 


27 82 




FT 


DISULFID 


102 102 INTERCHAIN (WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


106 106 INTERCHAIN (WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


109 109 INTERCHAIN (WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


111 111 INTERCHAIN (WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


140 200 




FT 


DISULFID 


246 304 




FT 


CARBOHYD 


176 176 N-LINKED (GLCNAC. . 


. ) (POTENTIAL) . 


SQ 


SEQUENCE 


326 AA; 35946 MW; 013BAB45EF49B9DA 


CRC64 ; 


Query Match 


66.1%; Score 818.5; DB 1; 


Length 326; 



Best Local Similarity 63.1%; Pred. No. 1.3e-59; 
Matches 142; Conservative 41; Mismatches 35; Indels 7; Gaps 2; 

Qy 7 CPPCPAPELLGG PSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGV 63 

I M =1 llhlllllll I h Ihllllllhl h III I 

Db 106 CKPC ICTGSEVSSVFIFPPKPKDVLTITLTPKVTCVVVDISQDDPEVHFSWFVDDV 161 

QY 64 EVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQ 123 

Ml ' ,1 I ' I || I :|||||||h -llh: I hllllllll ' 



Db 162 EVHTAQTRPPEEQFNSTFRSVSELPILHQDWLNGRTFRCKVTSAAFPSPIEKTISKPEGR 221 



Qy 124 PREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG 183 

: I III: h^hhhlhlhllllll II llh ||||: ||| I 
Db 222 TQVPHVYTMSPTKEEMTQNEVSITCMVKGFYPPDIYVEWQMNGQPQENYKNTPPTMDTDG 281 

Qy 184 SFFLYSKLTVDKSRWQQGNVPSCSVMHEALHNHYTQKSLSLSPGK 228 

hllllll I I Mill I |III = | = IMI I I I I 

Db 282 SYFLYSKLNVKKEKWQQGNTFTCSVLHEGLHNHHTEKSLSHSPGK 326 



RESULT 11 
GC1_M0USE 

ID GC1_M0USE STANDARD; PRT; 324 AA. 

AC P01868; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -1 chain C region secreted form. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID-10090; . 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=80045036; PubMed=115593 ; 

RA Honjo T., Obata M. , Yamawaki-Kataoka Y., Kataoka T., Kawakami T. , 

RA Takahashi N. , Mano Y. ; 

RT "Cloning and complete nucleotide sequence of mouse immunoglobulin 

RT gamma 1 chain gene." ; 

RL Cell 18:559-568(1979). 

RN [2] 

RP SEQUENCE OF 76-324 FROM N.A. (MYELOMA PROTEIN MOPC 31C) . 

RX MEDLINE=80202559; PubMed=6769752 ; 

RA Obata M. , Yamawaki-Kataoka Y. , Takahashi N . , Kataoka T. , Shimizu A., 

RA Mano Y. , Seidman J.G. , Peterlin B.M., Leder P., Honjo T. ,- 

RT "Immunoglobulin gamma 1 heavy chain gene: structural gene sequences 

RT cloned in a bacterial plasmid."; 

RL Gene 9:87-97 (1980) . 

RN [3] 

RP SEQUENCE OF 70-322 FROM N.A. (MYELOMA PROTEIN MOPC 21) . 

RX MEDLINE=80012837; PubMed=113776 ; 

RA Rogers J., Clarke P., Salser W.; 

RT "Sequence analysis of cloned cDNA encoding part of an immunoglobulin 

RT heavy chain. " ; 

RL Nucleic Acids Res. 6:3305-3321(1979). 

RN [4] 

RP SEQUENCE (MYELOMA PROTEIN MOPC 21) . 

RX MEDLINE=78242288; PubMed=98524 ; 

RA Adetugbo K. ; 

RT "Evolution of immunoglobulin subclasses. Primary structure of a 

RT murine myeloma gamma 1 chain."; 

RL J. Biol. Chem. 253:6068-6075(1978). 

RN [5] 

RP DISULFIDE BONDS (MOPC 21) . 

RX MEDLINE=73008889; PubMed=5073237 ; 

RA Svasti J., Milstein C. ; 



RT "The disulphide bridges of a mouse immunoglobulin Gl protein."; 

RL Biochem. J. 126:837-850(1972). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event =Alternative splicing; Named isoforms-2; 

CC Name=Secreted; 

CC IsoId=P01868-l; Sequence=Displayed; 

CC Note=May be the major isoform; 

CC Name =Membr ane - bound ; 

CC IsoId=P01869-l; Sequence=External ; 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL out station - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

OR EMBL; V00793; CAA24172.1; -. 

DR EMBL; V00793; CAA24173.1; -. 

DR EMBL; V00793 ; CAA24174.1; -. 

DR EMBL; V00793; CAA24175.1; -. 

DR EMBL; V00795; CAA24176.1; -. 

DR PIR; A02159; G1MS . 

DR PDB; 1IGC; 03-JUN-95. 

DR GlycoSuiteDB; P01868; -. 

DR MGD; MGI: 96446; Igh-4 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein,- 

KW Alternative splicing; 3D-structure . 

FT NONJTER 1 1 

FT DOMAIN 1 97 CHI. 

FT DOMAIN 98 110 HINGE. 

FT DOMAIN 111 217 CH2 . 

FT DOMAIN 218 324 CH3 . 

FT DISULFID 27 82 

FT DISULFID 102 102 INTERCHAIN (WITH A LIGHT CHAIN) . 

FT DISULFID 104 104 INTERCHAIN (WITH A HEAVY CHAIN) . 

FT DISULFID 107 107 INTERCHAIN (WITH A HEAVY CHAIN) . 

FT DISULFID 109 109 INTERCHAIN (WITH A HEAVY CHAIN) . 

FT DISULFID 138 198 

FT CARBOHYD 174 174 N-LINKED ( GLCNAC . . .). 

FT /FTId=CAR__000055 . 

FT DISULFID 244 302 

FT MOD_RES 324 324 REMOVED POST-TRANSLATIONALLY . 

FT CONFLICT 276 276 N -> D (IN REF . 3). 

FT CONFLICT 278 278 N -> D (IN REF. 3). 

SQ SEQUENCE 324 AA; 35704 MW; A338812F3D1F2C93 CRC64 ; 



Query Match 



66.0%; Score 817.5; DB 1; Length 324; 



Best Local Similarity 62.5%; Pred. No. 1.5e-59; 

Matches 140; Conservative 44; Mismatches 35; Indels 5; Gaps 2; 



Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

I I I Ih IMIIIII | h Ihllllllhl :|||hhhll II 

Db 104 CKPCICTVPEV SSVFIFPPKPKDVLTITLTPKVTCVVVDISKDDPEVQFSWFVDDVE 160 

Qy 65 VT1NAKTKPREEQYNSTYRWS VIjT VTjHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

II hhllllhllhl II I ::|lllllllhlhh: I I I I I I j I I f I Ihl 

Db 161 VHTAQTQPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRP 220 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 



* • ■ * * * * i * * I * ■ M I I I II I l II 

Db 221 KAPQVYTIPPPKEQMAKDKVSLTCMITDFFPEDITVEWQWNGQPAENYKNTQPIMNTNGS 28 0 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I I II h II hllhll lllhhilll MM 

Db 281 YFWSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKSLSHSPGK 324 

RESULT 12 
GC1M_M0USE 

ID GC1M_M0USE STANDARD; PRT; 393 AA. 

AC P01869; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -1 chain C region, membrane -bound form. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-80045036; PubMed=115593 ; 

RA Honjo T., Obata M. , Yamawaki-Kataoka Y. , Kataoka T., Kawakami T. , 

RA Takahashi N., Mano Y. ; 

RT "Cloning and complete nucleotide sequence of mouse immunoglobulin 

RT gamma 1 chain gene." ; 

RL Cell 18:559-568(1979). 

RN [2] 

RP SEQUENCE OF 323-3 93 FROM N.A. 

RX MEDLINE=82197626; PubMed=68 04 950 ; 

RA Tyler B.M., Cowman A.F., Gerondakis S.D., Adams J.M., Bernard 0. ; 

RT "mRNA for surface immunoglobulin gamma chains encodes a highly 

RT conserved transmembrane sequence and a 28-residue intracellular 

RT domain. 11 ; 

RL Proc. Natl. Acad. Sci . U.S.A. 79:2008-2012(1982). 

RN [3] 

RP SEQUENCE OF 323-366 FROM N.A. 

RX MEDLINE=82115295; PubMed=6799207 ; 

RA Rogers J., Choi E. , Souza L. , Carter C. , Word C.J., Kuehl M. , 

RA Eisenberg D. , Wall R. ; 

RT "Gene segments encoding transmembrane carboxyl termini of 

RT immunoglobulin gamma chains."; 

RL Cell 26:19-27(1981) . 

RN [4] 



RP . SEQUENCE OF 1-44 FROM N . A. 

RX MEDLINE=82222190; PubMed=628353 7 ; 

RA Yamawaki-Kataoka Y., Nakai S., Miyata T. , Honjo T.; 

RT "Nucleotide sequences of gene segments encoding membrane domains of 

RT immunoglobulin gamma chains. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:2623-2627(1982). 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event-Alternative splicing; Named isoforms=2; 

CC Name=Membrane -bound ; 

CC IsoId=P01869-l; Sequence=Displayed; 

CC Name=Secreted; 

CC IsoId=P01868-l; Sequence^ External ; 

CC Note=May. be the major isoform; 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL out station - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC , or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; V00793; CAA24172.1; 

DR EMBL; V00793; CAA24173.1; -. 

DR EMBL; V00793; CAA24174.1; 

DR PIR; B02159; G1MSM . 

DR PDB; 15C8; 23-MAR-99. 

DR PDB; 1AE6; 18-MAR-98. 

DR PDB; 1CL7; 12-JAN-OO. 

DR PDB; 1F11; 06-FEB-01. 

DR PDB; 1F58; 29-DEC-99. 

DR PDB ; 1KC5; 24-JUL-02. 

DR PDB; 1KCR; ll-MAY-02. 

DR PDB; 25C8; 09-JUL-99. 

DR MGD; MGI: 96446; Igh-4 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 



KW 


Alternative 


splicing; 


Transmembrane ; 3 D - s t rue ture . 




FT 


NON_TER 


1 


1 






FT 


DOMAIN 


1 


97 


CHI. 




FT 


DOMAIN 


98 


110 


HINGE. 




FT 


DOMAIN 


111 


217 


CH2 . 




FT 


DOMAIN 


218 


324 


CH3. 




FT 


DISULFID 


27 


82 






FT 


DISULFID 


102 


102 


INTERCHAIN (WITH A LIGHT 


CHAIN) 


FT 


DISULFID 


104 


104 


INTERCHAIN (WITH A HEAVY 


CHAIN) 


FT 


DISULFID 


107 


107 


INTERCHAIN (WITH A HEAVY 


CHAIN) 


FT 


DISULFID 


109 


109 


INTERCHAIN (WITH A HEAVY 


CHAIN) 


FT 


DISULFID 


138 


198 






FT 


CARBOHYD 


174 


174 


N- LINKED (GLCNAC. . .). 




FT 


DISULFID 


244 


302 







FT TRANSMEM 34 0 357 POTENTIAL. 

FT DOMAIN 358 393 CYTOPLASMIC (POTENTIAL) . 

SQ SEQUENCE 393 AA; 43386 MW; 4CC88343B7A1CE27 CRC64 ; 

Query Match 65.6%; Score 812.5; DB 1; Length 393; 

Best Local Similarity 62.3%; Pred. No. 4.9e-59; 

Matches 139; Conservative 44; Mismatches 35; Indels 5; Gaps 2; 
Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWyVDGVE 64 

II | ||: 1 1 h 1 1 1 1 1 1 1 I h I h 1 1 1 1 1 1 1 : 1 = 1 1 1 h I • I • I I II 

D b 104 CKPCICTVPEV---SSVFIFPPKPKDVLTITLTPKVTCVVVDISKDDPEVQFSWFVDDVE 160 

QV 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQP 124 

|| hhllllhllhl II I -I'. , ' i iM I IIIIMIIII M l 

Db 161 VHTAQTQPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRP 220 

Oy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: IIMhll :::: |::|||||:: I = I I I I I I = I I I I I I I ' I I : : : : : I I 

Db 221 KAPQVYTIPPPKEQMAKDKVSLTCMITDFFPEDITVEWQWNGQPAENYKNTQPIMNTNGS 280 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 227 

' h | II h II MINI 11,1.1' III 
Db 281 YFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKSLSHSPG 323 



RESULT 13 
GCC_RAT 

ID GCC_RAT STANDARD; PRT; 32 9 AA. 

AC P20762; 

DT 01-FEB-1991 (Rel . 17, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-2C chain C region. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=88166903; PubMed=3 127222 ; 

RA Brueggemann M., Delmastro-Galf re P., Waldmann H. , Calabi F. ; 

RT "Sequence of a rat immunoglobulin gamma 2c heavy chain constant 

RT region cDNA: extensive homology to mouse gamma 3."; 

RL Eur. J. Immunol. 18:317-319(1988). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X07189; CAA3 0169 . 1 ; - . 

DR PIR; S00847; S00847. 

DR HSSP; P01842; 7FAB . 

DR InterPro; IPR007110; Ig-like. 



DR 


InterPro; 


IPR003597; Ig 


cl. 






DR 


InterPro; 


IPR003006; Ig 


_MHC. 






DR 


Pfam; PF00047; ig; 


2. 








DR 


SMART; SM00407; IGcl; 2 








DR 


PROSITE; 


PS50835; 


IG_LIKE; 3. 






DR 


PROSITE ; 


PS00290; 


IG_MHC; 1. 






KW 


Immunoglobulin domain; 


Immunoglobulin C region. 




FT 


NON_TER 


1 


1 








FT 


DOMAIN 


1 


97 




CHI. 




FT 


DOMAIN 


98 


113 




HINGE. 




FT 


DOMAIN 


114 


222 




CH2 . 




FT 


DOMAIN 


223 


329 




CH3. 




FT 


DISULFID 


15 


15 




INTERCHAIN (WITH A 


LIGHT CHAIN) . 


FT 


DISULFID 


27 


82 








FT 


DISULFID 


111 


111 




INTERCHAIN (WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


113 


113 




INTERCHAIN (WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


143 


203 








FT 


DISULFID 


249 


307 








SQ 


SEQUENCE 


329 AA; 


36571 MW, 


; 5FCD7B7933850773 


CRC64 ; 


Query Match 




65 


.4%; 


Score 809.5; DB 1; 


Length 32 9; 


Best Local Similarity 63 


. 8 % ; 


Pred. No. 6.9e-59; 





Matches 143; Conservative 41; Mismatches 37; Indels 3; Gaps 



1; 



Qy 


8 


Db 


106 


Qy 


65 


Db 


166 


Qy 


125 


Db 


226 


Qy 


185 


Db 


286 



II I : II I I Ilh Ihlllllllll hhhhhl 



hi Ml | hill I I : . I I — ' I : I I I ' : I I I I I I I I I I 1 I =1 



REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 

I Mllhll I ^ I I ! • Ill- Mill I I • I : I I I I I I I I I I 

RTPQVYTIPPPREQMSKNKVSLTCMVTSFYPASISVEWERNGELEQDYKNTLPVLDSDES 

FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

:||||||:|| I :|::::|||:|IMIIhllhll I I I I 
YFLYSKLSVDTDSWMRGDI YTCSWHEALHNHHTQKNLSRSPGK 329 



184 



285 



RESULT 14 
GCAA_MOUSE 

ID GCAA_MOUSE STANDARD ; PRT; 33 0 AA. 

AC P01863; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma-2A chain C region, A allele. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=81076554; PubMed=6777755 ; 

RA Sikorav J.-L., Auffray C. , Rougeon F . ; 



RT "Structure of the constant and 3 1 untranslated regions of the murine 

RT Balb/c gamma 2a heavy chain messenger RNA . " ; 

RL Nucleic Acids Res. 8:3143-3155(1980). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=81198976; PubMed=6262729 ; 

RA Yamawaki-Kataoka Y., Miyata T. , Honjo T. ; 

RT "The complete nucleotide sequence of mouse immunoglobin gamma 2a gene 

RT and evolution of heavy chain genes: further evidence for intervening 

RT sequence-mediated domain transfer. "; 

RL Nucleic Acids Res. 9:1365-1381(1981). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=81223894; PubMed-6787604 ; 

RA Olio R., Auffray C, Morchamps C. , Rougeon F . ; 

RT "Comparison of mouse immunoglobulin gamma 2a and gamma 2b chain genes 

RT suggests that exons can be exchanged between genes in a multigenic 

RT family."; 

RL Proc. Natl. Acad. Sci. U.S.A. 78:2442-2446(1981). 

RN [4] 

RP MYELOMA PROTEIN MOPC 173. 

RX MEDLINE=74175517; PubMed=4831970 ; 

RA Bourgois A., Fougereau M. , Rocca-Serra J.; 

RT "Determination of the primary structure of a mouse IgG2a 

RT immunoglobulin: amino -acid sequence of the Fc fragment. Implications 

RT for the evolution of immunoglobulin structure and function."; 

RL Eur. J. Biochem. 43:423-435(1974). 

RN [5] 

RP DISULFIDE BONDS. 

RX MEDLINE=73056887; PubMed=45654 06 ; 

RA de Preval C. , Fougereau M. 

RT "Determination of the primary structure of a mouse gamma G2a 

RT immunoglobulin. Identification of the disulfide bridges ." ; 

RL Eur. J. Biochem. 30:452-462(1972). 

CC -!- SIMILARITY: Contains 3 immunoglobulin- like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; V00798; CAA24178.1; 

DR PIR; A02152; G2MSA. 

DR PDB; 1E4W; 12-JUL-01. 

DR PDB; 1E4X; 12-JUL-01. 

DR PDB; 1MNU; 06-MAY-99. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 2. 

DR SMART; SM004 07; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IGJVIHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; 3D-structure; Repeat. 



FT 


NON_TER 


1 


1 






FT 


DOMAIN 


6 


98 


IG-LIKE 1. 




FT 


DOMAIN 


121 


220 


IG-LIKE 2. 




FT 


DOMAIN 


229 


325 


IG-LIKE 3. 




FT 


DISULFID 


15 


15 


INTERCHAIN 


(WITH A LIGHT CHAIN) 


FT 


DISULFID 


27 


82 






FT 


DISULFID 


107 


107 


INTERCHAIN 


(WITH A HEAVY CHAIN) 


FT 


DISULFID 


110 


110 


INTERCHAIN 


(WITH A HEAVY CHAIN) 


FT 


DISULFID 


112 


112 


INTERCHAIN 


(WITH A HEAVY CHAIN) 


FT 


DISULFID 


144 


204 






FT 


DISULFID 


250 


308 






FT 


MOD_RES 


330 


330 


REMOVED POST-TRANSLATIONALLY. 


SQ 


SEQUENCE 


330 AA; 


36389 MW; 


B84361C5445A6864 CRC64 ; 


Query Match 




65.3%; 


Score 8 09; 


DB 1; Length 330; 



Best Local Similarity 65.2%; Pred. No. 7.6e-59; 

Matches 146; Conservative 30; Mismatches 46; Indels 2; Gaps 1; 



Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVE 64 

III II 1 1 IIIIIIIIMIII II 1 1 1 1 I MINIMI Mhh MM: M 

Db 107 CPPCKCPAPNLLGGPSVFIFPPKIKDVljMISLSPIVTCVVVDVSEDDPDVQISWFVNNVE 166 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

II I = h I MM MM I I II I : M II M 1 1 1 M I I II II I M 1 1 1 II 

Db 167 VHTAQTQTHREDYNSTLRWSALPI QHQDWMSGKEFKCKVNNKDLPAPI ERTI SKPKGSV 226 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

I MM III MMI IIMIhl I I II III Mh I III I 1 1 1 1 1 1 i I 

Db 227 RAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGS 286 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

■llli I : I MIIIMI MUM M I MM 

Db 287 YFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPGK 33 0 

RESULT 15 
GCAM_MOUSE 

ID GCAM_MOUSE STANDARD; PRT; 3 99 AA. 

AC P01865; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Ig gamma -2A chain C region, membrane -bound form. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI__TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=82222190; PubMed=6283537 ; 

RA Yamawaki-Kataoka Y. , Nakai S., Miyata T. , Honjo T. ; 

RT "Nucleotide sequences of gene segments encoding membrane domains of 

RT immunoglobulin gamma chains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:2623-2627(1982). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event ^Alternative splicing; Named isoforms=2; 

CC Name=Membrane -bound; 



CC IsoId=P01865-l; Sequence=Displayed; 

CC Name=Secreted; 

CC I soId=P018 64 - 1 ; Sequence-External ; 

CC Note=Probably the major isoform; 

CC -!- SIMILARITY: Contains 3 immunoglobulin- like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J00471; AAB59661.1; ALT_INIT. 

DR PIR; A02154; G2MSAM. 

DR PDB; 1KB5; 08 -APR- 98. 

DR PDB; 1YEE; 15 -OCT- 97. 

DR MGD; MGI: 96443; Igh-1. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003 006; Ig_MHC. 

DR Pfam; PF00047; ig ; 2. 

DR SMART; SM00407; IGcl; 2. 

DR. PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein ; 

KW Transmembrane; Alternative splicing; 3D-structure; Repeat. 



FT 


NON__TER 


1 


1 










FT 


DOMAIN 


6 


98 




IG-LIKE 1. 






FT 


DOMAIN 


121 


220 




IG-LIKE 2. 






FT 


DOMAIN 


229 


325 




IG-LIKE 3. 






FT 


DISULFID 


15 


15 




INTERCHAIN 


(WITH A 


LIGHT CHAIN) . 


FT 


DISULFID 


27 


82 










FT 


DISULFID 


107 


107 




INTERCHAIN 


(WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


110 


110 




INTERCHAIN 


(WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


112 


112 




INTERCHAIN 


(WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


144 


204 










FT 


DISULFID 


250 


308 










FT 


TRANSMEM 


346 


363 




POTENTIAL . 






FT 


DOMAIN 


364 


399 




CYTOPLASMIC 


(POTENTIAL) . 


FT 


CARBOHYD 


180 


180 




N- LINKED (GLCNAC. . 


. .) (POTENTIAL) 


SQ 


SEQUENCE 


399 AA; 


44020 MW; 


4C38138BFAED3FF0 


CRC64 ; 


Query Match 




64. 


.9%; 


Score 804; 


DB 1; 


Length 3 99; 


Best Local Similarity 


65. 


.0%; 


Pred . No . 2 


.5e-58; 





Matches 145; Conservative 30; Mismatches 46; Indels 



Gaps 



1; 



Qy 
Db 

Qy 
Db 



7 CPP- -CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVE 64 

III 1 1 1 1 IIIIIIMII II 1 1 1 1 I MINIM MMM :hh 1 1 

107 CPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVE 166 
65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQP 124 

II hh I 1 1 1 1 1 1 1 1 I : MMMMMMMM lllllhllll II 

167 VHTAQTQTHREDYNSTLRWSALP I QHQDWMSGKEFKCKVNNKDLPAP I ERTI SKPKGSV 226 



Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 



I I I I I III :hll Ihllhl I I II III Mh I Ml I MINIM 
Db 227 RAPQWVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGS 286 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 227 

MMIM hi I = I Mllhll Mlhl II I Ml 

Db 287 YFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPG 329 

Search completed: January 16, 2 004, 15:57:37 
Job time : 19 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 16, 2 004, 15:54:50 ; Search time 35 Seconds 

(without alignments) 
1681.028 Million cell updates/sec 

US-09-843-221A-2 
1238 

1 MDKTHTCPPCPAPELLGGPS MHEALHNHYTQKSLSLS PGK 228 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 830525 seqs, 258052604 residues 

Total number of hits satisfying chosen parameters: 



830525 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL 23:* 



1 




sp_archea : * 


2 




sp_bacteria : * 


3 




sp_f ungi : * 


4 




sp_human : * 


5 




sp invertebrate : * 


6 




sp mammal : * 


7 




sp_mhc : * 


8 




sp_organelle: * 


9 




sp__phage : * 


10: 


sp_plant : * 


11: 


sp_rodent : * 


12: 


sp_virus : * 


13: 


sp_vertebrate : * 


14: 


sp_unclassif ied: * 


15: 


sp_rvirus : * 


16: 


sp__bacteriap : * 


17: 


sp_ar cheap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1233 


99 


.6 


471 


4 


Q8TC77 


Q8tc77 homo 


sapien 


2 


1233 


99 


.6 


701 


4 


Q96PQ8 


Q96pq8 homo 


sapien 


3 


1142 


92 


.2 


521 


4 


Q8N4Y9 


Q8n4y9 homo 


sapien 


4 


1135 


91 


.7 


473 


4 


Q8TC63 


Q8tc63 homo 


sapien 


5 


1131 


91 


.4 


509 


4 


Q8NF17 


Q8nfl7 homo 


sapien 


6 


909 


73 


.4 


337 


6 


Q95M34 


Q95m34 equus cabal 


7 


831.5 


67 


.2 


469 


11 


Q8R3V9 


Q8r3v9 mus 


musculu 


8 


828 .5 


66, 


.9 


463 


11 


Q99LC4 


Q991c4 mus 


musculu 


9 


824.5 


66 


.6 


437 


11 


Q9R1A4 


Q9rla4 mus 


musculu 


10 


811 


65 


.5 


473 


11 


Q9D8L4 


Q9d8l4 mus 


musculu 


11 


805 


65 


.0 


468 


11 


Q99L31 


Q99131 mus 


musculu 


12 


805 


65 


.0 


473 


11 


Q99L25 


Q99125 mus 


musculu 


13 


780 


63, 


. 0 


473 


11 


Q91205 


Q91z05 mus 


musculu 


14 


780 


63 


.0 


474 


11 


Q8R3H6 


Q8r3h6 mus 


musculu 


15 


358 


28, 


.9 


375 


4 


Q9BS21 


Q9bszl homo 


sapien 


16 


358 


28 


.9 


597 


4 


Q9BU10 


Q9bul0 homo 


sapien 


17 


358 


28 , 


.9 


597 


4 


Q9BQB8 


Q9bqb8 homo 


sapien 


18 


358 


28 


.9 


597 


4 


Q96BB9 


Q96bb9 homo 


sapien 


19 


348 


28 , 


.1 


588 


4 


Q8WUX4 


Q8wux4 homo 


sapien 


20 


348 


28. 


.1 


613 


4 


Q96EY0 


Q96ey0 homo 


sapien 


21 


348 


28, 


.1 


613 


4 


Q8WUK1 


Q8wukl homo 


sapien 


22 


348 


28, 


. 1 


614 


4 


Q96GA6 


Q96ga6 homo 


sapien 


23 


348 


28, 


.1 


618 


4 


Q96AA6 


Q96aa6 homo 


sapien 


24 


347 


28, 


.0 


613 


11 


Q8VCX7 


Q8vcx7 mus 


musculu 


25 


272.5 


22 


.0 


. 684 


13 


Q90544 


Q90544 ginglymosto 


26 


272 


22, 


.0 


384 


4 


Q9UP60 


Q9up60 homo 


sapien 


27 


272 


22. 


. 0 


493 


4 


Q8NCL6 


Q8ncl6 homo 


sapien 


28 


272 


22, 


.0 


494 


4 


Q96K68 


Q96k68 homo 


sapien 


29 


272 


22. 


.0 


496 


4 


Q96KX8 


Q96kx8 homo 


sapien 


30 


269 


21, 


.7 


496 


4 


Q96DK0 


Q96dk0 homo 


sapien 


31 


269 


21. 


.7 


499 


4 


Q8N5K4 


Q8n5k4 homo 


sapien 


32 


267 


21. 


.6 


416 


4 


Q9NPP6 


Q9npp6 homo 


sapien 


33 


267 


21. 


.6 


497 


4 


Q8WY24 


Q8wy24 homo 


sapien 


34 


267 


21, 


,6 


500 


4 


Q9BRV0 


Q9brv0 homo 


sapien 


35 


261 


21. 


. 1 


486 


11 


Q91Z07 


Q91z07 mus 


musculu 


36 


261 


21. 


.1 


487 


11 


Q99KA4 


Q99ka4 mus 


musculu 


37 


260 


21. 


. 0 


426 


11 


Q9DCD9 


Q9dcd9 mus 


musculu 


38 


253.5 


20. 


.5 


481 


11 


Q91WT3 


Q91wt3 mus 


musculu 


39 


253.5 


20. 


,5 


481 


11 


Q91WT1 


Q91wtl mus 


musculu 


40 


253 .5 


20. 


.5 


482 


11 


Q91X92 


Q91x92 mus 


musculu 


41 


253 .5 


20. 


.5 


482 


11 


Q8K172 


Q8kl72 mus 


musculu 


42 


253 .5 


20. 


.5 


484 


11 


Q8VEA0 


Q8vea0 mus 


musculu 


43 


253 .5 


20. 


5 


488 


11 


Q91WR1 


Q91wrl mus 


musculu 


44 


253 .5 


20. 


,5 


488 


11 


Q8K0F2 


Q8k0f2 mus 


musculu 


45 


253.5 


20. 


,5 


489 


11 


Q8VCX4 


Q8vcx4 mus 


musculu 



ALIGNMENTS 



RESULT 1 
Q8TC77 

ID Q8TC77 PRELIMINARY 
AC Q8TC77; 

DT 01-JUN-2002 (TrEMBLrel . 
DT 01-JUN-2002 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 



PRT; 471 AA. 
21, Created) 

21, Last sequence update) 
23, Last annotation update) 



DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Spleen; 

RA Strausberg R. ; 

RL Submitted (FEB-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC024289; AAH24289.1; -. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003 006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Hypothetical protein. 

SQ SEQUENCE 471 AA; 51791 MW; 388F7F4CF58 8660E CRC64 ; 

Query Match 99.6%; Score 1233; DB 4; Length 471; 

Best Local Similarity 100.0%; Pred. No. 3.9e-108; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 61 

1 ,1.1 1 MM I Mill I IIIIIIIII I III II IIIIIIIII I Mill II III Mill II 

DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEVKFNWYVD 3 04 
GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

II III II Mill 1 1 MM 1 1 Mill I Mill IIIIIIIII M Ml M I MMI II I MM 

GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 364 

GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

M II I I I II M M II II II II II I M II I II I II II I II I I I I M M II II II I I M I II 
GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 424 

DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 L 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Qy 


2 


Db 


245 


Qy 


62 


Db 


305 


Qy 


122 


Db 


365 


Qy 


182 


Db 


425 



RESULT 2 
Q96PQ8 

ID Q96PQ8 PRELIMINARY; PRT; 701 AA. 

AC Q96PQ8; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Factor VII active site mutant immunoconjugate. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI__TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21477448; PubMed=11593034 ; 



RA 

RT 

RT 

RL 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

SQ 



Hu Z., Garen A. ; 

"Targeting tissue factor on tumor vascular endothelial cells and tumor 

cells for immunotherapy in mouse models of prostatic cancer."; 

Proc. Natl. Acad. Sci. U.S.A. 98:12180-12185(2001). 

-!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY SI. 

EMBL; AF272774; AAK58686.1; -. 

HSSP; P00761; 1AN1 . 

Asx_hydroxyl . 
Chymo trypsin. 
EGF_2 . 
EGF_Ca . 
EGF_I I . 
EGF_like. 
GLAjDlood. 
Ig-like . 
Ig_cl. 
Ig_MHC. 

Ser_protease_Try . 
VitK dep GLA. 



IPR000152, 
IPR001314; 
IPR000742; 
IPR001881; 
IPR001438; 
IPR006209; 
IPR002383; 
IPR007110; 
IPR003597; 
IPR003006; 
IPR001254; 
IPR000294; 
Pfam; PF00008; EGF; 2. 
Pfam; PF00594; gla; 1. 
Pfam; PF00047; ig; 2. 
Pfam; PF0008 9; trypsin; 1. 
PRINTS ; PR00722; CHYMOTRYPSIN . 
PRINTS; PR00010; EGFBLOOD . 
PRINTS; PRO 0001; GLABLOOD . 
SMART; SM00179; EGF_CA; 1. 
SMART; SM00069; GLA; 1. 
SMART; SM004 07; IGcl; 1- 
SMART; SM00020; Tryp_SPc; 1. 



InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 



PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 
PROSITE 



PS00010 
PS00022 
PS01186 
PS01187 
PS00011 
PS50835 
PS00290 
PS50240 
PS00134 
PS00135 



1. 



ASX_HYDROXYL; 
EGF_1; 1. 
EGF_2; 1. 
EGF_CA; 1. 
GLU_CARBOX YLAT I ON ; 
IG_LIKE; 2. 
IG_MHC; 1. 
TRY P S I N_DOM ; 
TRYPSIN_HIS; 
TRYPSIN_SER; 

EGF-like domain; Hydrolase; Protease; Serine protease. 
SEQUENCE 701 AA; 77826 MW; 94AC6CEB42CC992F CRC64; 



1. 
1. 
1. 



Query Match 99.6%; Score 1233; DB 4; Length 701; 

Best Local Similarity 100.0%; Pred. No. 6.4e-108; 

Matches 227; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 



Db 



Qy 



Db 



2 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 61 

IMIIIIIIIIIIIIIIIIIIIIIIMIMMMIIMIIIIIIIIIIIIMIIMIIII 

475 DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVD 534 
62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 121 

1 1 L 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

535 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAK 594 



Qy 



122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

IIIIMMIIMIIIIIIIMIIIIMIIIIIIIIMIIIMIIIIMIMIIIIIMII 



595 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 654 
182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IIIIIIIIIIIIIMIMIIIIIIIMIIIMIIIIIIMIIIIIM 

655 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 701 



RESULT 3 
Q8N4Y9 

ID Q8N4Y9 PRELIMINARY; PRT; 521 AA. 

AC Q8N4Y9 ; 

DT 01-OCT-2002 (TrEMBLrel . 22 , Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ,- 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Primary B-Cells from Tonsils; 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC033178; AAH33178.1; 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; lg_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM004 07; IGcl; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Hypothetical protein. 

SQ SEQUENCE 521 AA; 57156 MW; 2AC7D22E72D6CAA2 CRC64 ; 



Query Match 92.2%; Score 1142; DB 4; Length 521; 

Best Local Similarity 92.1%; Pred. No. 1.8e-99; 

Matches 209; Conservative 9; Mismatches 9; Indels 0; Gaps 



Qy 


2 


DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEWCvWDVSHEDPEVKFNWYVD 

I M MMMMMMMMMMMMMMMIMMMMMMIhl MM 

DTPPPCPRCPAPELLGGPSVFLFPPKPKDTlJyiISRTPEVTCVVVDVSHEDPEVQFKWYVD 


61 


Db 


295 


354 


Qy 


62 


GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 

1 1 1 II II 1 II II M h M 1 : 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 1 1 II M 1 II 1 1 II 1 1 1 i II 1 1 

GVEVHNAKTKPREEQFNSTFRWSVLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKTK 


121 


Db 


355 


414 


Qy 


122 


GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 

M II 1 1 1 1 1 1 1 1 1 M : h 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 II 1 h II 1 1 1 1 i Mlhlll 

GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 


181 


Db 


415 


474 


Qy 


182 


DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 ^ 1 1 1 1 1 1 1 1 1 1 M MIMMMIII 

DGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSPGK 52 1 




Db 


475 





RESULT 4 
Q8TC63 

ID Q8TC63 PRELIMINARY; PRT; 473 AA. 

AC Q8TC63; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2 003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RA Strausberg R.; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC025985; AAH25985.1; 

DR InterPro; IPR000923; BlueCu_l . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig__v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM00406; IGv; 1. . 

DR PROSITE; PS00196; COPPER_BLUE; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 3. 

KW Hypothetical protein. 

SQ SEQUENCE 473 AA; 51986 MW; E29920B09BA369F5 CRC64 ; 

Query Match 91.7%; Score 1135; DB 4; Length 473; 

Best Local Similarity 93.7%; Pred. No. 7.1e-99; 

Matches 208; Conservative 8; Mismatches 6; Indels 0; Gaps 0; 

CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH 66 

II Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 k 1 L 1 1 1 1 IMIhllMIIMMI 

C P S C PAP E FLGG P S VFLF P P KP KDTLM I S RT P E VTC VWD VS QED P E VQFNW Y VDG VE VH 311 
NAKTKPREEQYNSTYRWS VLTVLHQDWLNGKEYKCKVSNKALPAP I EKTI SKAKGQPRE 126 

lllllllllhllllllllllllllllllllllllllllM i IMIIIIIIIIMI 

NAKTKPREEQFNSTYRWSVLTVLHQDWLNGKEYKCKVSNKGLPSS I EKTI SKAKGQPRE 371 

PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 186 

I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

PQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDI AVEWESNGQPENNYKTTPPVLDSDGSFF 431 

LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

M M 1 1 1 1 1 1 1 HI 1 1 1 . 1 1 1 1 1 1 1 i 1 1 II 1 1 1 1 1 , II 



Qy 


7 


Db 


252 


Qy 


67 


Db 


312 


Qy 


127 


Db 


372 


Qy 


187 


Db 


432 



RESULT 5 
Q8NF17 

ID Q8NF17 PRELIMINARY; PRT; 509 AA. 

AC Q8NF17; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 



DT 01-OCT-2002 (TrEMBLrel . 22, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE FLJ00385 protein (Fragment) . 

GN FLJ0038 5. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Spleen; 

RA Jikuya H. , Takano J., Kikuno R. , Nagase T., Ohara O. ; 

RT "The nucleotide sequence of a long cDNA clone isolated from human 

RT spleen."; 

RL Submitted (JUL-2002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AK090464; BAC03445.1; 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; IgJVlHC. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM004 07; IGcl; 3. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 2. 

FT NON_TER 1 1 

SQ SEQUENCE 509 AA; 56111 MW; 089498D8076E863C CRC64; 

Query Match 91.4%; Score 1131; DB 4; Length 509; 

Best Local Similarity 92.0%; Pred. No. 1.9e-98; 

Matches 207; Conservative 9; Mismatches 9; Indels 0; Gaps 0; 
Qy 2 DKTHTCPPCPAPELl^GPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVD 61 

I II IIIIIIIMIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIMhl 1 1 1 1 

Db 214 DTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTOAA/DVSHEDPEVQFKWYVD 273 

Qy 62 GVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAK 121 

MIIIIIIIMIMh' MMIMMIMIIMI MMIMMMIMMIII I I 

Db 274 GVEVHNAKTKPREEQFNSTFRWSVljTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKTK 333 

Qy 122 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDS 181 

MM ! II I ! II ! II!, I M I ' lllhlll 

Db 334 GQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDS 3 93 

Qy 182 DGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 226 

I I I I I I I I I I I I I I . I I I h M I I I I I I I I I I :||||||||| 
Db 394 DGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSP 438 

RESULT 6 
Q95M34 

ID Q95M34 PRELIMINARY; PRT; 337 AA. 

AC Q95M34; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Immunogobulin gamma 1 heavy chain constant region (Fragment) . 

GN IGHC1 . 

OS Equus caballus (Horse) . 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Perissodactyla; Equidae; Equus . 
NCBI_TaxID=9796; 
[1] 

SEQUENCE FROM N.A. 
Wagner B . ; 

Submitted (DEC-2000) to the EMBL/GenBank/DDBJ databases. 
[2] 

SEQUENCE FROM N.A. 

MEDLINE=98383416; PubMed-9717671 ; 

Wagner B . , Overesch G., Sheoran A., Holmes M., Richards C. , 
Leibold W., Radbruch A.; 

"Organization of the equine immunoglobulin heavy chain constant region 
genes. III. Alignment of c-mu, c-gamma, c-epsilon and c-alpha genes."; 
Immunobiology 199:105-119(1998). 
EMBL; AJ300675; CAC44624.1; 
InterPro; IPR007110; Ig-like. 
InterPro; IPR003597; Ig_cl. 
InterPro; IPR003006; Ig_MHC. 
Pfam; PF00047; ig; 2. 
SMART; SM004 07; IGcl; 2. 
PROSITE; PS50835; IG_LIKE; 3. 
PROSITE; PS00290; IG_MHC; 2. 
NON_TER 1 1 

SEQUENCE 337 AA; 37438 MW; A60BF2B01DEFD1F6 CRC64 ; 

Query Match 73.4%; Score 909; DB 6; Length 337; 

Best Local Similarity 71.9%; Pred. No. l.le-77; 

Matches 161; Conservative 32; Mismatches, 29; Indels 2; Gaps 



Qy 


7 


CPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCvWDVSHEDPEVKFNWYVDGVEVH 

II llllllllllllhlll llllllhlllllllllllll hhlllllhlllll 

C P KC PA P ELLGG P S VF I F P PN P KDTLM I TRT P E VTC VWD VS QEN PD VKFNW YMDG VE VR 


66 


Db 


114 


173 


Qy 


67 


NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPRE 

I hhllhllllllllll : II II h M h 1 II h h 1 1 1 llhlhl Ih : 1 

TATTRPKEEQFNSTYRWSVLRIQHQDWLSGKEFKCKVNNQALPQPIERTITKTKGRSQE 


126 


Db 


174 


233 


Qy 


127 


PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP - -ENNYKTTPPVLDSDGS 

MM 1 1 II h h M h II 1 II III M M h M 1 II 1 III Mill 

PQVYVLAPHPDELSKSKVSVTCLVKDFYPPEINIEWQSNGQPELETKYSTTQAQQDSDGS 


184 


Db 


234 


293 


Qy 


185 


FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

: II 1 II h 1 h • II M 1 hi MIMIIIIIIIhM MM 

YFLYS KLS VDRNRWQQGTTFTCGVMHEALHNHYTQKN VS KNPGK 337 




Db 


294 





OC 
OC 
OX 
RN 
RP 
RA 
RL 
RN 
RP 
RX 
RA 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
FT 
SQ 



RESULT 7 
Q8R3V9 

ID Q8R3V9 PRELIMINARY; PRT; 4 69 AA. 

AC Q8R3V9; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Hypothetical 52.0 kDa protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 



OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus . 

OX NCB I _Tax I D= 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R.; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/ DDE J databases. 

DR EMBL; BC024405; AAH24405.1; -. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IGJVIHC; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 469 AA; 51976 MW; 534793F155D05457 CRC64 ; 

Query Match 67.2%; Score 831.5; DB 11; Length 469; 

Best Local Similarity 63.8%; Pred. No. 3.5e-70; 

Matches 143; Conservative 41; Mismatches 35; Indels 5; Gaps 2; 
Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVE 64 

M I lh 1 1 1 : 1 1 1 1 1 1 1 I h I h 1 1 1 1 1 1 1 : 1 H I I h I • h I I II 

Db 249 CKPCICTVPEV SSVFIFPPKPKDVLTITLTPKVTCVVVDISKDDPEVQFSWFVDDVE 305 

Qy 65 WNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

|| hllllllhllhl II I ::|IIIIIMhlhh: I llllllllll 
Db 306 VHTAQTKPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRP 365 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

: I'l l :::: I.I. hi II llh I I II III I I : : I = I I I 

Db 3 66 KAPQVYTIPPPKEQMAKDKVSLTCMITDFFPEDITVEWQWNGQPAENYKNTQPIMDTDGS 425 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I | M h II hllhll lllhhilll I I I I 

Db 426 YFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKSLSHSPGK 469 



RESULT 8 
Q99LC4 

ID Q99LC4 PRELIMINARY; PRT; 463 AA. 

AC Q99LC4; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to RIKEN cDNA 1810060009 gene. 

GN IGH-4. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R.; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC003435; AAH03435.1; -. 

DR HSSP; P01842; 7 FAB . 



DR MGD; MGI: 96446; Igh-4 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 4. 

DR SMART; SM004 06; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS 00290; IG_MHC; 1. 

SQ SEQUENCE 463 AA; 51007 MW; EAA674C6BBC3 0783 CRC64; 

Query Match 66.9%; Score 828.5; DB 11; Length 463; 

Best Local Similarity 63.4%; Pred. No. 6.6e-70; 

Matches 142; Conservative 42; Mismatches 35; Indels 5; Gaps 2; 

Qy 7 CPP - -CPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVVVDVSHEDPEVKFNWYVDGVE 64 

I I i 11= llhllMIII I I Ihllllllhl :|||hhh|| M 

Db 243 CKPCICTVPEV SSVFIFPPKPKDVIjTITLTPKVTCVVVDISKDDPEVQFSWFVDDVE 299 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQP 124 

II hhllllhllhl II I ::|IMIIIIhlhh: I llllllllll Ihl 

Db 300 VHTAQTQPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRP 359 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

= Mllhll :::: I L ^ hi II llh I I I I III I l-hlll 

Db 360 KAPQWTIPPPKEQMAKDKVSLTCMI TDFFPEDI TVEWQWNGQPAENYKNTQPI MDTDGS 419 

Qy 185 F FL YS KLT VDKS RWQQGN VF S CS VMH EALHNH YTQ KS LS LS PGK 228 

! I II h II hllhll lllhhilll 1 1 1 1 

Db 420 YFI YSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKSLSHSPGK 463 



RESULT 9 
Q9R1A4 

ID Q9R1A4 PRELIMINARY; PRT; 437 AA. 

AC Q9R1A4; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Gammal heavy chain of Mab7 (Fragment). 

GN IGH-4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Wilde K.G., Yu X., Ekramoddoullah A.K.M., Misra S . ,- 

RT "Cloning of cDNAs encoding for anti-white pine blister rust monoclonal 

RT antibody (Mab 7, its light and heavy chains) and construction of a 

RT single chain antibody (scFV)."; 

RL Submitted (MAY-1999) to the EMBL/ GenBank / DDB J databases. 

DR EMBL ; AF152372; AAD40243.1; 

DR HSSP; P01842; 7 FAB . 

DR MGD; MGI : 96446 ; Igh-4. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 



DR Pfam; PF00047; ig ; 4. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

FT NON__TER 1 1 

FT NON_TER 437 437 

SQ SEQUENCE 437 AA; 48142 MW; 5C3A7BB3EE7D697C CRC64 ; 



Query Match 66.6%; Score 824.5; DB 11; Length 437; 

Best Local Similarity 62.9%; Pred. No. 1.5e-69; 

Matches 141; Conservative 43; Mismatches 35; Indels 5; Gaps 



2; 



Qy 

Db 

Qy 
Db 



7 CPP - - CPAPELLGGPSVFLFPPKPKDTLMI SRTPEWCWVDVSHEDPEVKFNWYVDGVE 64 

Mill: llhlllllll I h Ihllllllhl : I I I I : I : I : | I II 

217 CKPCICTVPEV SSVFIFPPKPKDVLTITLTPKVTCWVDISKDDPEVQFSWFVDDVE 273 

65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

II hhllllhllhl || | ::|||||||||:|h|:: I MINIM Ihl 
274 VHTAQTQPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRP 333 



Qy 



Db 



125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

h -h ■ II hi II llh I I 1 1 III h I II 

334 KAPQVYTIPPPKEQMAKDKVSLTCMITDFFPEDITVEWQWNGQPAENYKNTQPIMDTDGS 393 



Qy 



Db 



185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

■ h I II h II hllhll lllhhhil II I I 

3 94 YFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKNLSHSPGK 437 



RESULT 10 
Q9D8L4 

ID Q9D8L4 PRELIMINARY; PRT; 473 AA. 

AC Q9D8L4; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE 1810060O09Rik protein. 

GN IGH-1 OR 1810060O09RIK. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Pancreas ; 

RX MEDLINE=21085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M. , Itoh M. , Ishii Y., 

RA Arakawa T. , Hara A., Fukunishi Y. , Konno H., Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M. , Nishi K. , Kiyosawa H. , Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y. , Gojobori T. , Bono H. , Kasukawa T. , Saito R., 

RA Kadota K. , Matsuda H.A., Ashburner M., Batalov S., Casavant T. # 

RA Fleischmann W., Gaasterland T. , Gissi C. , King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G. , Quackenbush J., 

RA Schriml L.M., Staubli F., Suzuki R. , Tomita M. , Wagner L. , Washio T. , 

RA Sakai K. , Okido T. , Furuno M. , Aono H. , Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D. , Bojunga N. , Carninci P., de Bonaldo M.F., 

RA Brownstein M. J. , Bult C. , Fletcher C. , Fujita M., Gariboldi M . , 



RA Gustincich S., Hill D. , Hofmann M., Hume D.A. , Kamiya M . , Lee N.H., 

RA Lyons P., Marchionni L. , Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B. , Ringwald M . , Rodriguez I., Sakamoto N . , 

RA Sasaki H., Sato K. , Schoenbach C. , Seya T. , Shibata Y., Storch K.-F., 

RA Suzuki H., Toyo-oka K. , Wang K.H. , Weitz C, Whittaker C. , Wilming L. , 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y. , Kawa j i H., Kohtsuki S., 

RA Hayashizaki Y.; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

DR EMBL; AK007918; BAB25349.1; -. 

DR HSSP; P01842; 7FAB . 

DR MGD; MGI: 96443; Igh-1. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 4. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

SQ SEQUENCE 473 AA; 51699 MW; 9DED57A514475FBB CRC64; 

Query Match 65.5%; Score 811; DB 11; Length 473; 

Best Local Similarity 64.0%; Pred. No. 3.1e-68; 

Matches 142; Conservative 38; Mismatches 42; Indels 0; Gaps 0; 

Qy 7 CPPCPAPELLGGPSVFLFPPKPIODTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH 66 

I I I I Ihlllllllhllll II I I I I H lllllllll :|hh ! MM 

Db 252 CPPCAAPDLLGGPSVFIFPPKIKDVLMISLSPMVTCWVDVSEDDPDVQISWFVNNVEVH 311 

Qy 67 NAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPRE 126 

I I I II I I I I I I : lllhHIhIllhhIlhlMIIIM = I I 

Db 312 TAQTQTHREDYNSTLRWSALPIQHQDWMSGKEFKCKVNNRALPSPIEKTISKPRGPVRA 371 

QY 127 PQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF 186 

I I I I Ml I : lllh: \\ |::|||:| I | ||| | | | || I I I = I 

Db 372 PQVYVLPPPAEEMTKKEFSLTCMITGFLPAEIAVDWTSNGRTEQNYKNTATVLDSDGSYF 431 

Qy 187 LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

Mill I II h : h : I : I I h I I MM I MM I II 
Db 432 MYSKLRVQKSTWERGSLFACSWHEGLHNHLTTKTI SRSLGK 473 



RESULT 11 
Q99L31 

ID Q99L31 PRELIMINARY; PRT; 468 AA. 

AC Q99L31; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to RIKEN cDNA 1810060009 gene. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae,- Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 



RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC003878; AAH03878.1; 

DR HSSP; P01842; 7FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

SQ SEQUENCE 468 AA; 51661 MW; 963 52328B3332ADB CRC64 ; 



Query Match 65.0%; Score 805; DB 11; Length 468; 

Best Local Similarity 65.2%; Pred. No. l.le-67; 

Matches 146; Conservative 30; Mismatches 46; Indels 2; Gaps 1; 
Qy 7 CPP--CPAPELLGGPSVFLFPPKPKDTLMISRTPEWC\AA^VSHEDPEVKFNWYVT)GVE 64 

III 1 1 1 1 1 1 ' I II I II I M MINIMI Mhh I II 

Db 245 CPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPMVTCVWDVSEDDPDVQISWFVNNVE 304 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQP 124 

I I I MM MM I : lllhMlhlllhllllllllhllll 

Db 305 VLTAQTQTHREDYNSTLRWSALPI QHQDWMSGKEFKCKWNKALPAPI ERTI SKPKGSV 364 

Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS 184 

llllllll IM | | || Ml :||: | Ml | IIIIIIM 

Db 3 65 RAPQVTVljPPPEEEMTKKQVTLTCMvTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGS 424 

Qy 18 5 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

MMIM hi I : I : I I I I : I I lllhl II I MM 
Db 425 YFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPGK 468 



RESULT 12 
Q99L25 



ID Q99L25 PRELIMINARY; PRT; 473 AA. 

AC Q99L25; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to RIKEN cDNA 1810060009 gene. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI__TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RA Strausberg R. ; 

RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC003888; AAH03888.1; -. 

DR HSSP; P01842; 7FAB. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; I PRO 03 596; Ig_v. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 



DR PROSITE; PS00290; IG_MHC; 1. 

SQ SEQUENCE 473 AA; 52449 MW; BE9889B7986DA155 CRC64; 

Query Match 65.0%; Score 805; DB 11; Length 473; 

Best Local Similarity 65.2%; Pred. No. l.le-67; 

Matches 146; Conservative 30; Mismatches 46; Indels 2; Gaps 1; 

Qy 7 CPP--CPAPEL]^GPSVFLFPPKPKDTLMISRTPEvTCVVVDVSHEDPEVKFNWYVDGVE 64 

Ml I I I I lllllllhllll II llll = | IIIMIMI I I- :hh II 
Db 25 0 CPPCKCPAPNLLGGPSVFI FPPKI KDVLMI SLSPMVTCVWDVSEDDPDVQISWFVNNVE 3 09 

Qy 65 VHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQP 124 

I hh I llll llll I : IllhUlhlllhllllllllhllll 

Db 310 VLTAQTQTHREDYNSTLRWSALPI QHQDWMSGKEFKCKVNNKALPAP I ERTI SKPKGSV 369 



Qy 125 REPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEVJESNGQPENNYKTTPPVLDSDGS 184 

lllll lll = 1 = 11 I hi I hi I I II III :||= I Ml I llllllll 
Db 370 RAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGS 429 

Qy 185 FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

I LI hi I : I MM Ml hi II I Mil 

Db 430 YFMYSKLRVEKKNWERNSYSCSWHEGLHNHHTTKSFSRTPGK 473 



RESULT 13 
Q91Z05 

ID Q91Z05 PRELIMINARY; PRT; 473 AA. 

AC Q91Z05; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical 51.9 kDa protein. 

GN AU044919. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N. A. 

RA Strausberg R. ; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC010327; AAH10327.1; -. 

DR MGD; MGI: 2144967; AU044919. 

DR InterPro; IPR000345; CytC_heme_bind . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS00190; CYTOCHROME_C ; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 473 AA; 51946 MW; CF625F008 932AF12 CRC64 ; 



Query Match 63.0%; Score 78 0; DB 11; Length 473; 

Best Local Similarity 61.8%; Pred. No. 2.6e-65; 



Matches 141; Conservative 34; Mismatches 47; Indels 6; Gaps 1; 



Qy 7 CPP CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYV 60 

III 1 1 1 1 I MIIIIMM II 1 1 1 1 Mil III I Mhh MM 

Db 246 CPPCKECHKCPAPNLEGGPSVFIFPPNIKDVLMISLTPKVTCVWDVSEDDPDVQISWFV 305 

Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 

■■ MM I I MM MM I : MlhMMMIMMI IIMIIMIII 

Db 306 NNVEVHTAQTQTHREDYNSTIRWSALPIQHQDWMSGKEFKCKVNNKDLPSPIERTISKI 365 

QY 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 18 0 

II I MM III :M;: III M I MMM III I III I MM 

Db 3 66 KGLVRAPQVYI LPPPAEQLSRKDVSLTCLWGFNPGDI SVEWTSNGHTEENYKDTAPVLD 425 

Qy 181 SDGS FFL YS KLTVDKSRWQQGNVFS CS VMHEALHNH YTQKSLSLS PGK 228 

IMhhllll : | = | = : = ||h| M | h| :|: = | |||| 
Db 426 SDGSYFI YSKLDIKTSKWEKTDSFSCNVRHEGLKNYYLKKTISRSPGK 473 



RESULT 14 
Q8R3H6 

ID Q8R3H6 PRELIMINARY; PRT; 474 AA. 

AC Q8R3H6; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical 51.7 kDa protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (MAR-2002) to the EMBL/GenBank / DDB J databases. 

DR EMBL; BC025447; AAH25447.-1; 

DR InterPro; IPR000345; CytC_heme_bind. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS00190; CYTOCHROME_C ; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 474 AA; 51748 MW; 8608B57C6CD2874A CRC64; 

Query Match 63.0%; Score 780; DB 11; Length 474; 

Best Local Similarity 61.8%; Pred. No. 2.6e-65; 

Matches 141; Conservative 34; Mismatches 47; Indels 6; Gaps 1; 
Qy V CPP CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV 60 

Ml MM I lllllhlll M MM Ihlll MMM Mhh :hl 

Db 247 CPPCKECHKCPAPNLEGGPSVFIFPPNIKDVLMISLTPKVTCVVVDVSEDDPDVQISWFV 3 06 



Qy 61 DGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKA 12 0 



: I I I I hh I I I I I I I I I I : lllhMlhlllhll Ihllhllll 

Db 307 NNVEVHTAQTQTHREDYNSTIRWSALPIQHQDWMSGKEFKCKVNNKDLPSPIERTISKI 366 

Qy 121 KGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLD 180 

II I I I I I II! ::h: lllllll II I I I I I I Ml I III I I I I I 
Db 367 KGLVRAPQVYI LPPPAEQLSRKDVSLTCLWGFNPGDI SVEWTSNGHTEENYKDTAPVLD 426 

Qy 181 SDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

IllhhIIII = hi- : Mhl II I hi :| = = | MM 
Db 427 S DGS YF I YS KLD I KTS KWE KTDS FS CNVRHEGLKN Y YLKKT ISRSPGK 474 



RESULT 15 
Q9BSZ1 

ID Q9BSZ1 PRELIMINARY; PRT; 375 AA. 

AC Q9BSZ1; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel, 23, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 
RN . [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lymph; 

RA Strausberg R.; 

RL Submitted (MAR-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC004476; AAH04476.1; 

DR HSSP; P01857; 1FC1 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl . 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig ; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 3, 

KW Hypothetical protein. 

SQ SEQUENCE 375 AA; 41314 MW; B1A0A0998F473619 CRC64 ; 

Query Match 28.9%; Score 358; DB 4; Length 375; 

Best Local Similarity 34.1%; Pred. No. 1.5e-25; 

Matches 73; Conservative 48; Mismatches 85; Indels 8; Gaps 6; 

Qy 21 VFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNST 8 0 

M M == :::: =:|hl | = = I I M III hi 

Db 146 VFAI PPS-FASIFLTKSTKLTCLVTDLTTYD-SVTI SWTRQNGEAVKTHTNI SESHPNAT 203 

Qy 81 YRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQP-REPQVYTLPPSRDEL 139 

: I :: II :|: : I h= Ihh^llh M I II MhhM 

Db 2 04 FSAVGEASICEDDWNSGERFTCTVTHTDLPSPLKQTISRPKGVALHRPDVYLLPPAREQL 263 

Qy 14 0 T-KNQVSLTCLVKGFYPSDIAVEWESNGQP--ENNYKTTPPvTjD--SDGSFFLYSKLTVD 194 

= -MM M hh hi III I h h : : I M M III 

Db 264 NLRESATITCLVTGFSPADVFVQWMQRGQPLSPEKYVTSAPMPEPQAPGRYFAHSILTVS 323 



Qy 195 KSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 228 

= I I -I I I I I I I h = = = I II 
Db 324 EEEWNTGETYTCWAHEALPNRVTERTVDKSTGK 357 



Search completed: January 16, 2004, 15:58:27 
Job time : 38 sees 



